select multiple elements with group by in spark.sql

scala apache-spark apache-spark-sql bigdata

16,039

Yes, it's possible and the error message you attached describes all the possibilities. You can either add the userId to groupBy:

val sqlDF = spark.sql("SELECT count(customerId),customerId,userId FROM GETBYID group by customerId, userId");

or use first():

val sqlDF = spark.sql("SELECT count(customerId),customerId,first(userId) FROM GETBYID group by customerId");

16,039

Author by

rahul

<3 coading :)

Updated on June 05, 2022

Comments

rahul almost 2 years
is there any way to group by table in sql spark which selects multiple elements code i am using:
```
val df = spark.read.json("//path")
df.createOrReplaceTempView("GETBYID")
```
now doing group by like :
```
val sqlDF = spark.sql(
  "SELECT count(customerId) FROM GETBYID group by customerId");
```
but when I try:
```
val sqlDF = spark.sql(
  "SELECT count(customerId),customerId,userId FROM GETBYID group by customerId");
```
Spark gives an error :

org.apache.spark.sql.AnalysisException: expression 'getbyid.userId' is neither present in the group by, nor is it an aggregate function. Add to group by or wrap in first() (or first_value) if you don't care which value you get.;

is there any possible way to do that

Recents

Why Is PNG file with Drop Shadow in Flutter Web App Grainy?

How to troubleshoot crashes detected by Google Play Store for Flutter app

Cupertino DateTime picker interfering with scroll behaviour

Why does awk -F work for most letters, but not for the letter "t"?

Flutter change focus color and icon color but not works

How to print and connect to printer using flutter desktop via usb?

Critical issues have been reported with the following SDK versions: com.google.android.gms:play-services-safetynet:17.0.0

Flutter Dart - get localized country name from country code

navigatorState is null when using pushNamed Navigation onGenerateRoutes of GetMaterialPage

Android Sdk manager not found- Flutter doctor error

Flutter Laravel Push Notification without using any third party like(firebase,onesignal..etc)

How to change the color of ElevatedButton when entering text in TextField

Related

How to use Spark SQL to parse the JSON array of objects

How to slice and sum elements of array column?

How to find the max String length of a column in Spark using dataframe?

Convert a Spark Vector of features into an array

How to calculate Percentile of column in a DataFrame in spark?

Difference between two rows in Spark dataframe

Not able to write Spark SQL DataFrame to S3

How do I increase decimal precision in Spark?

Create new column with an array of range of numbers

merge multiple small files in to few larger files in Spark