How to count tickets with status change filtered byYear-To-Date in Pandas?How do you change the size of figures drawn with matplotlib?How can I count the occurrences of a list item?How to change the order of DataFrame columns?How to drop rows of Pandas DataFrame whose value in a certain column is NaNChange data type of columns in PandasHow do I get the row count of a pandas DataFrame?How to iterate over rows in a DataFrame in Pandas?How to deal with SettingWithCopyWarning in Pandas?How to count the NaN values in a column in pandas DataFrameHow to check if any value is NaN in a Pandas DataFrame

Proper way to shut down consumer

Is the mass of paint relevant in rocket design?

Could Apollo astronauts see city lights from the moon?

Is it impolite to ask for an in-flight catalogue with no intention of buying?

How to create fractional SI units (SI...sqrts)?

How to justify a team increase when the team is doing good?

Does wetting a beer glass change the foam characteristics?

How do you use the interjection for snorting?

What exactly did this mechanic sabotage on the American Airlines 737, and how dangerous was it?

what should be done first, handling missing data or dealing with data types?

How can this Stack Exchange site have an animated favicon?

Under what circumstances would RAM locations 0 and 1 be written and/or read on the C64?

How to clarify between imagined sensations and "real" fantasy events?

Designing a time thief proof safe

How to see the previous "Accessed" date in Windows

Does the Prepare Food ability from Cook's Utensils stack?

Do I have advantage with Riposte when moving away from a flanked enemy and triggering an opportunity attack?

Hangman Game (YAHG)

A famous scholar sent me an unpublished draft of hers. Then she died. I think her work should be published. What should I do?

Received a package but didn't order it

What is the meaning of word 'crack' in chapter 33 of A Game of Thrones?

Why did the Soviet Union not "grant" Inner Mongolia to Mongolia after World War Two?

Tesla coil and Tesla tower

How to say "cheat sheet" in French



How to count tickets with status change filtered byYear-To-Date in Pandas?


How do you change the size of figures drawn with matplotlib?How can I count the occurrences of a list item?How to change the order of DataFrame columns?How to drop rows of Pandas DataFrame whose value in a certain column is NaNChange data type of columns in PandasHow do I get the row count of a pandas DataFrame?How to iterate over rows in a DataFrame in Pandas?How to deal with SettingWithCopyWarning in Pandas?How to count the NaN values in a column in pandas DataFrameHow to check if any value is NaN in a Pandas DataFrame






.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty margin-bottom:0;








0















I have 2 snapshots of data set stored in 2 dataframes that contains JIRA tickets, there is a column called UpdatedDate which tells me when the snapshot was taken.
I want to calculate number of tickets which still remain open filter by Year-to-Date which basically means: how many tickets in total (combined snapshots) are still open util tomorrow (eg.2019-03-29).



But the problem is the both of my dataframes can contain the same JIRA issue, but the status of the ticket might or might not change.



# this df1 (Snapshot 1)
Issue key Project name Status UpdatedDate
111 Proj1 Analysis 2019-03-18
222 Proj1 Open 2019-03-18

# this df2 (Snapshot 2)
Issue key Project name Status UpdatedDate
111 Proj1 Done 2019-03-28
222 Proj1 Open 2019-03-28


So as the table indicated above, issue111's status has changed to Done on snapshot 2 where as issue222's status is still Open.
So if my Year-to-Date filter is set on 2019-03-29. it will show me 2 ticket with Status Open, but one of them will be a duplication.



How can I count number of ticket that are still open but without duplicates?










share|improve this question






























    0















    I have 2 snapshots of data set stored in 2 dataframes that contains JIRA tickets, there is a column called UpdatedDate which tells me when the snapshot was taken.
    I want to calculate number of tickets which still remain open filter by Year-to-Date which basically means: how many tickets in total (combined snapshots) are still open util tomorrow (eg.2019-03-29).



    But the problem is the both of my dataframes can contain the same JIRA issue, but the status of the ticket might or might not change.



    # this df1 (Snapshot 1)
    Issue key Project name Status UpdatedDate
    111 Proj1 Analysis 2019-03-18
    222 Proj1 Open 2019-03-18

    # this df2 (Snapshot 2)
    Issue key Project name Status UpdatedDate
    111 Proj1 Done 2019-03-28
    222 Proj1 Open 2019-03-28


    So as the table indicated above, issue111's status has changed to Done on snapshot 2 where as issue222's status is still Open.
    So if my Year-to-Date filter is set on 2019-03-29. it will show me 2 ticket with Status Open, but one of them will be a duplication.



    How can I count number of ticket that are still open but without duplicates?










    share|improve this question


























      0












      0








      0








      I have 2 snapshots of data set stored in 2 dataframes that contains JIRA tickets, there is a column called UpdatedDate which tells me when the snapshot was taken.
      I want to calculate number of tickets which still remain open filter by Year-to-Date which basically means: how many tickets in total (combined snapshots) are still open util tomorrow (eg.2019-03-29).



      But the problem is the both of my dataframes can contain the same JIRA issue, but the status of the ticket might or might not change.



      # this df1 (Snapshot 1)
      Issue key Project name Status UpdatedDate
      111 Proj1 Analysis 2019-03-18
      222 Proj1 Open 2019-03-18

      # this df2 (Snapshot 2)
      Issue key Project name Status UpdatedDate
      111 Proj1 Done 2019-03-28
      222 Proj1 Open 2019-03-28


      So as the table indicated above, issue111's status has changed to Done on snapshot 2 where as issue222's status is still Open.
      So if my Year-to-Date filter is set on 2019-03-29. it will show me 2 ticket with Status Open, but one of them will be a duplication.



      How can I count number of ticket that are still open but without duplicates?










      share|improve this question














      I have 2 snapshots of data set stored in 2 dataframes that contains JIRA tickets, there is a column called UpdatedDate which tells me when the snapshot was taken.
      I want to calculate number of tickets which still remain open filter by Year-to-Date which basically means: how many tickets in total (combined snapshots) are still open util tomorrow (eg.2019-03-29).



      But the problem is the both of my dataframes can contain the same JIRA issue, but the status of the ticket might or might not change.



      # this df1 (Snapshot 1)
      Issue key Project name Status UpdatedDate
      111 Proj1 Analysis 2019-03-18
      222 Proj1 Open 2019-03-18

      # this df2 (Snapshot 2)
      Issue key Project name Status UpdatedDate
      111 Proj1 Done 2019-03-28
      222 Proj1 Open 2019-03-28


      So as the table indicated above, issue111's status has changed to Done on snapshot 2 where as issue222's status is still Open.
      So if my Year-to-Date filter is set on 2019-03-29. it will show me 2 ticket with Status Open, but one of them will be a duplication.



      How can I count number of ticket that are still open but without duplicates?







      python pandas dataframe






      share|improve this question













      share|improve this question











      share|improve this question




      share|improve this question










      asked Mar 28 at 17:38









      bossangelobossangelo

      273 bronze badges




      273 bronze badges

























          1 Answer
          1






          active

          oldest

          votes


















          0
















          you can sort_values() and then drop_duplicates():



          pd.concat([df1, df2]) 
          .sort_values(['UpdatedDate'], ascending=[False])
          .drop_duplicates(['Issue key'], keep='first')
          .loc[lambda x: x.Status == 'Open']





          share|improve this answer



























            Your Answer






            StackExchange.ifUsing("editor", function ()
            StackExchange.using("externalEditor", function ()
            StackExchange.using("snippets", function ()
            StackExchange.snippets.init();
            );
            );
            , "code-snippets");

            StackExchange.ready(function()
            var channelOptions =
            tags: "".split(" "),
            id: "1"
            ;
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function()
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled)
            StackExchange.using("snippets", function()
            createEditor();
            );

            else
            createEditor();

            );

            function createEditor()
            StackExchange.prepareEditor(
            heartbeatType: 'answer',
            autoActivateHeartbeat: false,
            convertImagesToLinks: true,
            noModals: true,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: 10,
            bindNavPrevention: true,
            postfix: "",
            imageUploader:
            brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
            contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/4.0/"u003ecc by-sa 4.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
            allowUrls: true
            ,
            onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            );



            );














            draft saved

            draft discarded
















            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55403808%2fhow-to-count-tickets-with-status-change-filtered-byyear-to-date-in-pandas%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown

























            1 Answer
            1






            active

            oldest

            votes








            1 Answer
            1






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes









            0
















            you can sort_values() and then drop_duplicates():



            pd.concat([df1, df2]) 
            .sort_values(['UpdatedDate'], ascending=[False])
            .drop_duplicates(['Issue key'], keep='first')
            .loc[lambda x: x.Status == 'Open']





            share|improve this answer





























              0
















              you can sort_values() and then drop_duplicates():



              pd.concat([df1, df2]) 
              .sort_values(['UpdatedDate'], ascending=[False])
              .drop_duplicates(['Issue key'], keep='first')
              .loc[lambda x: x.Status == 'Open']





              share|improve this answer



























                0














                0










                0









                you can sort_values() and then drop_duplicates():



                pd.concat([df1, df2]) 
                .sort_values(['UpdatedDate'], ascending=[False])
                .drop_duplicates(['Issue key'], keep='first')
                .loc[lambda x: x.Status == 'Open']





                share|improve this answer













                you can sort_values() and then drop_duplicates():



                pd.concat([df1, df2]) 
                .sort_values(['UpdatedDate'], ascending=[False])
                .drop_duplicates(['Issue key'], keep='first')
                .loc[lambda x: x.Status == 'Open']






                share|improve this answer












                share|improve this answer



                share|improve this answer










                answered Mar 28 at 18:16









                jxcjxc

                2,9602 gold badges4 silver badges17 bronze badges




                2,9602 gold badges4 silver badges17 bronze badges

































                    draft saved

                    draft discarded















































                    Thanks for contributing an answer to Stack Overflow!


                    • Please be sure to answer the question. Provide details and share your research!

                    But avoid


                    • Asking for help, clarification, or responding to other answers.

                    • Making statements based on opinion; back them up with references or personal experience.

                    To learn more, see our tips on writing great answers.




                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function ()
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55403808%2fhow-to-count-tickets-with-status-change-filtered-byyear-to-date-in-pandas%23new-answer', 'question_page');

                    );

                    Post as a guest















                    Required, but never shown





















































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown

































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown







                    Popular posts from this blog

                    Kamusi Yaliyomo Aina za kamusi | Muundo wa kamusi | Faida za kamusi | Dhima ya picha katika kamusi | Marejeo | Tazama pia | Viungo vya nje | UrambazajiKuhusu kamusiGo-SwahiliWiki-KamusiKamusi ya Kiswahili na Kiingerezakuihariri na kuongeza habari

                    Swift 4 - func physicsWorld not invoked on collision? The Next CEO of Stack OverflowHow to call Objective-C code from Swift#ifdef replacement in the Swift language@selector() in Swift?#pragma mark in Swift?Swift for loop: for index, element in array?dispatch_after - GCD in Swift?Swift Beta performance: sorting arraysSplit a String into an array in Swift?The use of Swift 3 @objc inference in Swift 4 mode is deprecated?How to optimize UITableViewCell, because my UITableView lags

                    Access current req object everywhere in Node.js ExpressWhy are global variables considered bad practice? (node.js)Using req & res across functionsHow do I get the path to the current script with Node.js?What is Node.js' Connect, Express and “middleware”?Node.js w/ express error handling in callbackHow to access the GET parameters after “?” in Express?Modify Node.js req object parametersAccess “app” variable inside of ExpressJS/ConnectJS middleware?Node.js Express app - request objectAngular Http Module considered middleware?Session variables in ExpressJSAdd properties to the req object in expressjs with Typescript