aggregating multiple columns in data.table

r dataframe aggregate data.table

10,835

this is actually what i was looking for and is mentioned in the FAQ:

dtb[,lapply(.SD,mean),by="id"]

10,835

Related videos on Youtube

Author by

Alex

Updated on June 07, 2022

Comments

Alex almost 2 years
I have the following sample data.table:
```
dtb <- data.table(a=sample(1:100,100), b=sample(1:100,100), id=rep(1:10,10))
```
I would like to aggregate all columns (a and b, though they should be kept separate) by id using colSums, for example. What is the correct way to do this? The following does not work:
```
 dtb[,colSums, by="id"]
```
This is just a sample and my table has many columns so I want to avoid specifying all of them in the function name
Christoph_J almost 12 years

+1 These, you are completely right, this is definitely the better way. I always think that I should have everything in long format, but quite often, as in this case, doing the computations is more efficient. You should mark yours as the correct answer.
Matt Dowle over 11 years

+1 Btw, this syntax has been optimized in the latest v1.8.2. There used to be a speed penalty of using lapply in j like that, but no longer. Revisions to data.table wiki points 1 have been made, and to point 5 are needed.

Recents

Why Is PNG file with Drop Shadow in Flutter Web App Grainy?

How to troubleshoot crashes detected by Google Play Store for Flutter app

Cupertino DateTime picker interfering with scroll behaviour

Why does awk -F work for most letters, but not for the letter "t"?

Flutter change focus color and icon color but not works

How to print and connect to printer using flutter desktop via usb?

Critical issues have been reported with the following SDK versions: com.google.android.gms:play-services-safetynet:17.0.0

Flutter Dart - get localized country name from country code

navigatorState is null when using pushNamed Navigation onGenerateRoutes of GetMaterialPage

Android Sdk manager not found- Flutter doctor error

Flutter Laravel Push Notification without using any third party like(firebase,onesignal..etc)

How to change the color of ElevatedButton when entering text in TextField

Related

R data.table: How to sum variables by group based on a condition?

Use data.table to count and aggregate / summarize a column

Aggregate / summarize multiple variables per group (e.g. sum, mean)

Loop over a data.table rows with condition

Aggregate15 minute data to hourly

Aggregating table() over multiple columns in R without a "by" breakdown

How to aggregate data in data.frame in R

Apply Encoding to Entire Data.Table

Remove columns of dataframe based on conditions in R

Subset by column index in R - Data.Table vs. dataframe