Click here to Skip to main content
15,868,164 members

Questions

Questions

1
answer

How can I use the 'take(n)' function in spark 3.4.0 with pyspark to display the top 3 rows of a CSV file?

27-May-23 19:14pm - updated 27-May-23 20:56pm
0
answers

Change value in nested struct, array, struct in a spark dataframe using pyspark

17-Jan-23 20:00pm
1
answer

How to write code using spark api scala?

26-Aug-22 21:35pm - updated 29-Aug-22 17:11pm
0
answers

Iterate over pyspark array elemets and then within elements itself using loop.

20-Apr-22 23:17pm - updated 21-Apr-22 5:43am
0
answers

So there is a match_id, batsman, and batsman_runs column, batsman_runs column consist of values where he scored a number of runs in a ball like 0, 1, 2

20-Jun-21 21:56pm
1
answer

How to write the spark programming to count letters

12-May-21 7:56am - updated 12-May-21 8:36am
0
answers

Spark is not able to connect to hive metastore

26-Apr-21 20:35pm
0
answers

Pyspark is not working in my macos

26-Apr-21 14:50pm
0
answers

Creating data frames columns from a list of dictionaries

16-Mar-21 2:07am
1
answer

Use value like ISO week but week starting from sunday

5-Dec-20 1:59am - updated 5-Dec-20 3:38am
1
answer

If I access data via spark, can I control database table access at column level with impala

2-May-19 21:46pm - updated 5-May-19 23:46pm
1
answer

Spark scala-count even numbers from from file

16-Jun-18 20:59pm - updated 16-Jun-18 21:38pm
0
answers

How to merge two spark row

23-Apr-18 23:35pm
0
answers

Why get wrong index when saving data in libsvm format by using saveaslibsvmfile

3-Apr-18 6:24am
0
answers

Executing commands on remote spark(EC2) using local R(sparkr) interface hangs

7-Jul-17 18:52pm - updated 9-Jul-17 17:45pm
0
answers

Splitting dataset for cross validation fpgrowth in spark

26-Mar-17 22:17pm

To narrow down your search try filtering by tags using the Filter box at the top right.