Click here to Skip to main content
15,889,877 members
Everything / Artificial Intelligence / Big Data

Big Data

big-data

Great Reads

by Ilia Reznik, Vladimir Shatalov
How to classify articles on Wikipedia using XML dump
by Sacha Barber
Looking at Spark/Cassandra working together
by Ryan Scott White
Converts past and real-time stock market tick data into time-sliced summaries called Briefs
by Joezer BH
Explains the benefits of using the command line in the large folder delete case and shows an example of the syntax

Latest Articles

by Fazlur Rahman
Step by step procedure to install Hadoop 2.7.3 version on Ubuntu 16.04 operating system
by Ryan Scott White
Converts past and real-time stock market tick data into time-sliced summaries called Briefs
by Ilia Reznik, Vladimir Shatalov
Parser for Wikipedia pages from XML dump is presented. Extraction of biographical data and categories with their parents is shown as an example.
by Ilia Reznik, Vladimir Shatalov
How to classify articles on Wikipedia using XML dump

All Articles

Sort by Updated

Big Data 

13 Jul 2022 by E L 2022
I need help plotting some categorical and numerical Values in python. the code is given below: %%time import pandas as pd import numpy as np import matplotlib.pyplot as plt import seaborn as sns %%time ...
22 May 2022 by Fazlur Rahman
Step by step procedure to install Hadoop 2.7.3 version on Ubuntu 16.04 operating system
25 Mar 2022 by Viswanath Sitaraman
I'm trying to convert a piece of SQL code to HiveQL, and it's not working as expected. Please find below the code snippet in SQL that I'm attempting to convert: SQL Code:UPDATE C SET C.prod_l = P.prod_l, C.numprod = P.numprod, C.prod_cng...
15 Jan 2022 by Member 15489508
i Have masterdatafile its about 4GiG And more than 45 million records. How can i load this data in MongoDB? What I have tried: I have tried nothing so far.i just downloaded MongoDb
15 Jan 2022 by OriginalGriff
Perhaps this will help: Import Data into MongoDB[^]
4 Oct 2021 by Richard MacCutchan
The title says it all: Apache Spark™ - Unified Analytics Engine for Big Data[^]
10 May 2021 by Ryan Scott White
Converts past and real-time stock market tick data into time-sliced summaries called Briefs
10 Apr 2021 by Ilia Reznik, Vladimir Shatalov
Parser for Wikipedia pages from XML dump is presented. Extraction of biographical data and categories with their parents is shown as an example.
9 Apr 2021 by Ilia Reznik, Vladimir Shatalov
How to classify articles on Wikipedia using XML dump
12 Dec 2020 by BedantBiswal
Below is my query which takes around 5k mappers and 1k reducers and time taken is around 2.2 hours to finish. Any scope of optimization in here? What I have tried: SELECT sum(B.item_net_amount) net_amount, sum(B.item_gross_amount) gross_amount,...