Grouping by with Where conditions in Pandas

python pandas where-clause pandas-groupby

17,515

It seems you need query or boolean indexing first for filtering:

pauses.query("pause_end > pause_start")
       .groupby(["subscription_id"])["dif_pause"].mean().reset_index(name="avg_pause")

pauses[pauses["pause_end"] > pauses["pause_start"]]
      .groupby(["subscription_id"])["dif_pause"].mean().reset_index(name="avg_pause")

17,515

Author by

Keithx

Updated on June 05, 2022

Comments

Keithx almost 2 years
Have a dataframe like this:

I created column 'dif_pause' based on subtracting 'pause_end' and 'pause_start' column values and doing the mean value aggregation using groupby () function just like this:
```
pauses['dif_pause'] = pauses['pause_end'] - pauses['pause_start']
pauses['dif_pause'].astype(dt.timedelta).map(lambda x: np.nan if pd.isnull(x) else x.days)

pauses_df=pauses.groupby(["subscription_id"])["dif_pause"].mean().reset_index(name="avg_pause")
```
I'd like to include in the groupby section the checking whether pause_end>pause_start (some equialent of WHERE clause in SQL). How can one do that?

Thanks.

Recents

Why Is PNG file with Drop Shadow in Flutter Web App Grainy?

How to troubleshoot crashes detected by Google Play Store for Flutter app

Cupertino DateTime picker interfering with scroll behaviour

Why does awk -F work for most letters, but not for the letter "t"?

Flutter change focus color and icon color but not works

How to print and connect to printer using flutter desktop via usb?

Critical issues have been reported with the following SDK versions: com.google.android.gms:play-services-safetynet:17.0.0

Flutter Dart - get localized country name from country code

navigatorState is null when using pushNamed Navigation onGenerateRoutes of GetMaterialPage

Android Sdk manager not found- Flutter doctor error

Flutter Laravel Push Notification without using any third party like(firebase,onesignal..etc)

How to change the color of ElevatedButton when entering text in TextField

Related

Pandas: how to get a particular group after groupby?

Python Pandas max value in a group as a new column

Pandas combine two group by's, filter and merge the groups(counts)

Pandas : Sum multiple columns and get results in multiple columns

Groupby column and find min and max of each group

Sumifs in Pandas with two conditions

pandas boxplots as subplots with individual y-axis

Pandas converting Rows to Columns

How to group dataframe by hour using timestamp with Pandas

Pandas split CSV into multiple CSV's (or DataFrames) by a column