How to count tickets with status change filtered byYear-To-Date in Pandas?How do you change the size of figures drawn with matplotlib?How can I count the occurrences of a list item?How to change the order of DataFrame columns?How to drop rows of Pandas DataFrame whose value in a certain column is NaNChange data type of columns in PandasHow do I get the row count of a pandas DataFrame?How to iterate over rows in a DataFrame in Pandas?How to deal with SettingWithCopyWarning in Pandas?How to count the NaN values in a column in pandas DataFrameHow to check if any value is NaN in a Pandas DataFrame

Proper way to shut down consumer

Is the mass of paint relevant in rocket design?

Could Apollo astronauts see city lights from the moon?

Is it impolite to ask for an in-flight catalogue with no intention of buying?

How to create fractional SI units (SI...sqrts)?

How to justify a team increase when the team is doing good?

Does wetting a beer glass change the foam characteristics?

How do you use the interjection for snorting?

What exactly did this mechanic sabotage on the American Airlines 737, and how dangerous was it?

what should be done first, handling missing data or dealing with data types?

How can this Stack Exchange site have an animated favicon?

Under what circumstances would RAM locations 0 and 1 be written and/or read on the C64?

How to clarify between imagined sensations and "real" fantasy events?

Designing a time thief proof safe

How to see the previous "Accessed" date in Windows

Does the Prepare Food ability from Cook's Utensils stack?

Do I have advantage with Riposte when moving away from a flanked enemy and triggering an opportunity attack?

Hangman Game (YAHG)

A famous scholar sent me an unpublished draft of hers. Then she died. I think her work should be published. What should I do?

Received a package but didn't order it

What is the meaning of word 'crack' in chapter 33 of A Game of Thrones?

Why did the Soviet Union not "grant" Inner Mongolia to Mongolia after World War Two?

Tesla coil and Tesla tower

How to say "cheat sheet" in French



How to count tickets with status change filtered byYear-To-Date in Pandas?


How do you change the size of figures drawn with matplotlib?How can I count the occurrences of a list item?How to change the order of DataFrame columns?How to drop rows of Pandas DataFrame whose value in a certain column is NaNChange data type of columns in PandasHow do I get the row count of a pandas DataFrame?How to iterate over rows in a DataFrame in Pandas?How to deal with SettingWithCopyWarning in Pandas?How to count the NaN values in a column in pandas DataFrameHow to check if any value is NaN in a Pandas DataFrame






.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty margin-bottom:0;








0















I have 2 snapshots of data set stored in 2 dataframes that contains JIRA tickets, there is a column called UpdatedDate which tells me when the snapshot was taken.
I want to calculate number of tickets which still remain open filter by Year-to-Date which basically means: how many tickets in total (combined snapshots) are still open util tomorrow (eg.2019-03-29).



But the problem is the both of my dataframes can contain the same JIRA issue, but the status of the ticket might or might not change.



# this df1 (Snapshot 1)
Issue key Project name Status UpdatedDate
111 Proj1 Analysis 2019-03-18
222 Proj1 Open 2019-03-18

# this df2 (Snapshot 2)
Issue key Project name Status UpdatedDate
111 Proj1 Done 2019-03-28
222 Proj1 Open 2019-03-28


So as the table indicated above, issue111's status has changed to Done on snapshot 2 where as issue222's status is still Open.
So if my Year-to-Date filter is set on 2019-03-29. it will show me 2 ticket with Status Open, but one of them will be a duplication.



How can I count number of ticket that are still open but without duplicates?










share|improve this question






























    0















    I have 2 snapshots of data set stored in 2 dataframes that contains JIRA tickets, there is a column called UpdatedDate which tells me when the snapshot was taken.
    I want to calculate number of tickets which still remain open filter by Year-to-Date which basically means: how many tickets in total (combined snapshots) are still open util tomorrow (eg.2019-03-29).



    But the problem is the both of my dataframes can contain the same JIRA issue, but the status of the ticket might or might not change.



    # this df1 (Snapshot 1)
    Issue key Project name Status UpdatedDate
    111 Proj1 Analysis 2019-03-18
    222 Proj1 Open 2019-03-18

    # this df2 (Snapshot 2)
    Issue key Project name Status UpdatedDate
    111 Proj1 Done 2019-03-28
    222 Proj1 Open 2019-03-28


    So as the table indicated above, issue111's status has changed to Done on snapshot 2 where as issue222's status is still Open.
    So if my Year-to-Date filter is set on 2019-03-29. it will show me 2 ticket with Status Open, but one of them will be a duplication.



    How can I count number of ticket that are still open but without duplicates?










    share|improve this question


























      0












      0








      0








      I have 2 snapshots of data set stored in 2 dataframes that contains JIRA tickets, there is a column called UpdatedDate which tells me when the snapshot was taken.
      I want to calculate number of tickets which still remain open filter by Year-to-Date which basically means: how many tickets in total (combined snapshots) are still open util tomorrow (eg.2019-03-29).



      But the problem is the both of my dataframes can contain the same JIRA issue, but the status of the ticket might or might not change.



      # this df1 (Snapshot 1)
      Issue key Project name Status UpdatedDate
      111 Proj1 Analysis 2019-03-18
      222 Proj1 Open 2019-03-18

      # this df2 (Snapshot 2)
      Issue key Project name Status UpdatedDate
      111 Proj1 Done 2019-03-28
      222 Proj1 Open 2019-03-28


      So as the table indicated above, issue111's status has changed to Done on snapshot 2 where as issue222's status is still Open.
      So if my Year-to-Date filter is set on 2019-03-29. it will show me 2 ticket with Status Open, but one of them will be a duplication.



      How can I count number of ticket that are still open but without duplicates?










      share|improve this question














      I have 2 snapshots of data set stored in 2 dataframes that contains JIRA tickets, there is a column called UpdatedDate which tells me when the snapshot was taken.
      I want to calculate number of tickets which still remain open filter by Year-to-Date which basically means: how many tickets in total (combined snapshots) are still open util tomorrow (eg.2019-03-29).



      But the problem is the both of my dataframes can contain the same JIRA issue, but the status of the ticket might or might not change.



      # this df1 (Snapshot 1)
      Issue key Project name Status UpdatedDate
      111 Proj1 Analysis 2019-03-18
      222 Proj1 Open 2019-03-18

      # this df2 (Snapshot 2)
      Issue key Project name Status UpdatedDate
      111 Proj1 Done 2019-03-28
      222 Proj1 Open 2019-03-28


      So as the table indicated above, issue111's status has changed to Done on snapshot 2 where as issue222's status is still Open.
      So if my Year-to-Date filter is set on 2019-03-29. it will show me 2 ticket with Status Open, but one of them will be a duplication.



      How can I count number of ticket that are still open but without duplicates?







      python pandas dataframe






      share|improve this question













      share|improve this question











      share|improve this question




      share|improve this question










      asked Mar 28 at 17:38









      bossangelobossangelo

      273 bronze badges




      273 bronze badges

























          1 Answer
          1






          active

          oldest

          votes


















          0
















          you can sort_values() and then drop_duplicates():



          pd.concat([df1, df2]) 
          .sort_values(['UpdatedDate'], ascending=[False])
          .drop_duplicates(['Issue key'], keep='first')
          .loc[lambda x: x.Status == 'Open']





          share|improve this answer



























            Your Answer






            StackExchange.ifUsing("editor", function ()
            StackExchange.using("externalEditor", function ()
            StackExchange.using("snippets", function ()
            StackExchange.snippets.init();
            );
            );
            , "code-snippets");

            StackExchange.ready(function()
            var channelOptions =
            tags: "".split(" "),
            id: "1"
            ;
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function()
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled)
            StackExchange.using("snippets", function()
            createEditor();
            );

            else
            createEditor();

            );

            function createEditor()
            StackExchange.prepareEditor(
            heartbeatType: 'answer',
            autoActivateHeartbeat: false,
            convertImagesToLinks: true,
            noModals: true,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: 10,
            bindNavPrevention: true,
            postfix: "",
            imageUploader:
            brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
            contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/4.0/"u003ecc by-sa 4.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
            allowUrls: true
            ,
            onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            );



            );














            draft saved

            draft discarded
















            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55403808%2fhow-to-count-tickets-with-status-change-filtered-byyear-to-date-in-pandas%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown

























            1 Answer
            1






            active

            oldest

            votes








            1 Answer
            1






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes









            0
















            you can sort_values() and then drop_duplicates():



            pd.concat([df1, df2]) 
            .sort_values(['UpdatedDate'], ascending=[False])
            .drop_duplicates(['Issue key'], keep='first')
            .loc[lambda x: x.Status == 'Open']





            share|improve this answer





























              0
















              you can sort_values() and then drop_duplicates():



              pd.concat([df1, df2]) 
              .sort_values(['UpdatedDate'], ascending=[False])
              .drop_duplicates(['Issue key'], keep='first')
              .loc[lambda x: x.Status == 'Open']





              share|improve this answer



























                0














                0










                0









                you can sort_values() and then drop_duplicates():



                pd.concat([df1, df2]) 
                .sort_values(['UpdatedDate'], ascending=[False])
                .drop_duplicates(['Issue key'], keep='first')
                .loc[lambda x: x.Status == 'Open']





                share|improve this answer













                you can sort_values() and then drop_duplicates():



                pd.concat([df1, df2]) 
                .sort_values(['UpdatedDate'], ascending=[False])
                .drop_duplicates(['Issue key'], keep='first')
                .loc[lambda x: x.Status == 'Open']






                share|improve this answer












                share|improve this answer



                share|improve this answer










                answered Mar 28 at 18:16









                jxcjxc

                2,9602 gold badges4 silver badges17 bronze badges




                2,9602 gold badges4 silver badges17 bronze badges

































                    draft saved

                    draft discarded















































                    Thanks for contributing an answer to Stack Overflow!


                    • Please be sure to answer the question. Provide details and share your research!

                    But avoid


                    • Asking for help, clarification, or responding to other answers.

                    • Making statements based on opinion; back them up with references or personal experience.

                    To learn more, see our tips on writing great answers.




                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function ()
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55403808%2fhow-to-count-tickets-with-status-change-filtered-byyear-to-date-in-pandas%23new-answer', 'question_page');

                    );

                    Post as a guest















                    Required, but never shown





















































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown

































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown







                    Popular posts from this blog

                    Kamusi Yaliyomo Aina za kamusi | Muundo wa kamusi | Faida za kamusi | Dhima ya picha katika kamusi | Marejeo | Tazama pia | Viungo vya nje | UrambazajiKuhusu kamusiGo-SwahiliWiki-KamusiKamusi ya Kiswahili na Kiingerezakuihariri na kuongeza habari

                    SQL error code 1064 with creating Laravel foreign keysForeign key constraints: When to use ON UPDATE and ON DELETEDropping column with foreign key Laravel error: General error: 1025 Error on renameLaravel SQL Can't create tableLaravel Migration foreign key errorLaravel php artisan migrate:refresh giving a syntax errorSQLSTATE[42S01]: Base table or view already exists or Base table or view already exists: 1050 Tableerror in migrating laravel file to xampp serverSyntax error or access violation: 1064:syntax to use near 'unsigned not null, modelName varchar(191) not null, title varchar(191) not nLaravel cannot create new table field in mysqlLaravel 5.7:Last migration creates table but is not registered in the migration table

                    은진 송씨 목차 역사 본관 분파 인물 조선 왕실과의 인척 관계 집성촌 항렬자 인구 같이 보기 각주 둘러보기 메뉴은진 송씨세종실록 149권, 지리지 충청도 공주목 은진현