How to count tickets with status change filtered byYear-To-Date in Pandas?How do you change the size of figures drawn with matplotlib?How can I count the occurrences of a list item?How to change the order of DataFrame columns?How to drop rows of Pandas DataFrame whose value in a certain column is NaNChange data type of columns in PandasHow do I get the row count of a pandas DataFrame?How to iterate over rows in a DataFrame in Pandas?How to deal with SettingWithCopyWarning in Pandas?How to count the NaN values in a column in pandas DataFrameHow to check if any value is NaN in a Pandas DataFrame
Proper way to shut down consumer
Is the mass of paint relevant in rocket design?
Could Apollo astronauts see city lights from the moon?
Is it impolite to ask for an in-flight catalogue with no intention of buying?
How to create fractional SI units (SI...sqrts)?
How to justify a team increase when the team is doing good?
Does wetting a beer glass change the foam characteristics?
How do you use the interjection for snorting?
What exactly did this mechanic sabotage on the American Airlines 737, and how dangerous was it?
what should be done first, handling missing data or dealing with data types?
How can this Stack Exchange site have an animated favicon?
Under what circumstances would RAM locations 0 and 1 be written and/or read on the C64?
How to clarify between imagined sensations and "real" fantasy events?
Designing a time thief proof safe
How to see the previous "Accessed" date in Windows
Does the Prepare Food ability from Cook's Utensils stack?
Do I have advantage with Riposte when moving away from a flanked enemy and triggering an opportunity attack?
Hangman Game (YAHG)
A famous scholar sent me an unpublished draft of hers. Then she died. I think her work should be published. What should I do?
Received a package but didn't order it
What is the meaning of word 'crack' in chapter 33 of A Game of Thrones?
Why did the Soviet Union not "grant" Inner Mongolia to Mongolia after World War Two?
Tesla coil and Tesla tower
How to say "cheat sheet" in French
How to count tickets with status change filtered byYear-To-Date in Pandas?
How do you change the size of figures drawn with matplotlib?How can I count the occurrences of a list item?How to change the order of DataFrame columns?How to drop rows of Pandas DataFrame whose value in a certain column is NaNChange data type of columns in PandasHow do I get the row count of a pandas DataFrame?How to iterate over rows in a DataFrame in Pandas?How to deal with SettingWithCopyWarning in Pandas?How to count the NaN values in a column in pandas DataFrameHow to check if any value is NaN in a Pandas DataFrame
.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty margin-bottom:0;
I have 2 snapshots of data set stored in 2 dataframes that contains JIRA tickets, there is a column called UpdatedDate which tells me when the snapshot was taken.
I want to calculate number of tickets which still remain open filter by Year-to-Date which basically means: how many tickets in total (combined snapshots) are still open util tomorrow (eg.2019-03-29).
But the problem is the both of my dataframes can contain the same JIRA issue, but the status of the ticket might or might not change.
# this df1 (Snapshot 1)
Issue key Project name Status UpdatedDate
111 Proj1 Analysis 2019-03-18
222 Proj1 Open 2019-03-18
# this df2 (Snapshot 2)
Issue key Project name Status UpdatedDate
111 Proj1 Done 2019-03-28
222 Proj1 Open 2019-03-28
So as the table indicated above, issue111's status has changed to Done on snapshot 2 where as issue222's status is still Open.
So if my Year-to-Date filter is set on 2019-03-29. it will show me 2 ticket with Status Open, but one of them will be a duplication.
How can I count number of ticket that are still open but without duplicates?
python pandas dataframe
add a comment
|
I have 2 snapshots of data set stored in 2 dataframes that contains JIRA tickets, there is a column called UpdatedDate which tells me when the snapshot was taken.
I want to calculate number of tickets which still remain open filter by Year-to-Date which basically means: how many tickets in total (combined snapshots) are still open util tomorrow (eg.2019-03-29).
But the problem is the both of my dataframes can contain the same JIRA issue, but the status of the ticket might or might not change.
# this df1 (Snapshot 1)
Issue key Project name Status UpdatedDate
111 Proj1 Analysis 2019-03-18
222 Proj1 Open 2019-03-18
# this df2 (Snapshot 2)
Issue key Project name Status UpdatedDate
111 Proj1 Done 2019-03-28
222 Proj1 Open 2019-03-28
So as the table indicated above, issue111's status has changed to Done on snapshot 2 where as issue222's status is still Open.
So if my Year-to-Date filter is set on 2019-03-29. it will show me 2 ticket with Status Open, but one of them will be a duplication.
How can I count number of ticket that are still open but without duplicates?
python pandas dataframe
add a comment
|
I have 2 snapshots of data set stored in 2 dataframes that contains JIRA tickets, there is a column called UpdatedDate which tells me when the snapshot was taken.
I want to calculate number of tickets which still remain open filter by Year-to-Date which basically means: how many tickets in total (combined snapshots) are still open util tomorrow (eg.2019-03-29).
But the problem is the both of my dataframes can contain the same JIRA issue, but the status of the ticket might or might not change.
# this df1 (Snapshot 1)
Issue key Project name Status UpdatedDate
111 Proj1 Analysis 2019-03-18
222 Proj1 Open 2019-03-18
# this df2 (Snapshot 2)
Issue key Project name Status UpdatedDate
111 Proj1 Done 2019-03-28
222 Proj1 Open 2019-03-28
So as the table indicated above, issue111's status has changed to Done on snapshot 2 where as issue222's status is still Open.
So if my Year-to-Date filter is set on 2019-03-29. it will show me 2 ticket with Status Open, but one of them will be a duplication.
How can I count number of ticket that are still open but without duplicates?
python pandas dataframe
I have 2 snapshots of data set stored in 2 dataframes that contains JIRA tickets, there is a column called UpdatedDate which tells me when the snapshot was taken.
I want to calculate number of tickets which still remain open filter by Year-to-Date which basically means: how many tickets in total (combined snapshots) are still open util tomorrow (eg.2019-03-29).
But the problem is the both of my dataframes can contain the same JIRA issue, but the status of the ticket might or might not change.
# this df1 (Snapshot 1)
Issue key Project name Status UpdatedDate
111 Proj1 Analysis 2019-03-18
222 Proj1 Open 2019-03-18
# this df2 (Snapshot 2)
Issue key Project name Status UpdatedDate
111 Proj1 Done 2019-03-28
222 Proj1 Open 2019-03-28
So as the table indicated above, issue111's status has changed to Done on snapshot 2 where as issue222's status is still Open.
So if my Year-to-Date filter is set on 2019-03-29. it will show me 2 ticket with Status Open, but one of them will be a duplication.
How can I count number of ticket that are still open but without duplicates?
python pandas dataframe
python pandas dataframe
asked Mar 28 at 17:38
bossangelobossangelo
273 bronze badges
273 bronze badges
add a comment
|
add a comment
|
1 Answer
1
active
oldest
votes
you can sort_values()
and then drop_duplicates()
:
pd.concat([df1, df2])
.sort_values(['UpdatedDate'], ascending=[False])
.drop_duplicates(['Issue key'], keep='first')
.loc[lambda x: x.Status == 'Open']
add a comment
|
Your Answer
StackExchange.ifUsing("editor", function ()
StackExchange.using("externalEditor", function ()
StackExchange.using("snippets", function ()
StackExchange.snippets.init();
);
);
, "code-snippets");
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "1"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/4.0/"u003ecc by-sa 4.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55403808%2fhow-to-count-tickets-with-status-change-filtered-byyear-to-date-in-pandas%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
you can sort_values()
and then drop_duplicates()
:
pd.concat([df1, df2])
.sort_values(['UpdatedDate'], ascending=[False])
.drop_duplicates(['Issue key'], keep='first')
.loc[lambda x: x.Status == 'Open']
add a comment
|
you can sort_values()
and then drop_duplicates()
:
pd.concat([df1, df2])
.sort_values(['UpdatedDate'], ascending=[False])
.drop_duplicates(['Issue key'], keep='first')
.loc[lambda x: x.Status == 'Open']
add a comment
|
you can sort_values()
and then drop_duplicates()
:
pd.concat([df1, df2])
.sort_values(['UpdatedDate'], ascending=[False])
.drop_duplicates(['Issue key'], keep='first')
.loc[lambda x: x.Status == 'Open']
you can sort_values()
and then drop_duplicates()
:
pd.concat([df1, df2])
.sort_values(['UpdatedDate'], ascending=[False])
.drop_duplicates(['Issue key'], keep='first')
.loc[lambda x: x.Status == 'Open']
answered Mar 28 at 18:16
jxcjxc
2,9602 gold badges4 silver badges17 bronze badges
2,9602 gold badges4 silver badges17 bronze badges
add a comment
|
add a comment
|
Thanks for contributing an answer to Stack Overflow!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55403808%2fhow-to-count-tickets-with-status-change-filtered-byyear-to-date-in-pandas%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown