Profile cover photo
Profile photo
akhil pathirippilly
5 followers
5 followers
About
Posts

Post has attachment
Detailed Architecture of Spark with Cluster Manager as YARN in HDFS .
Big Data Analytics
Big Data Analytics
pathirippilly.blogspot.com
Add a comment...

Post has attachment
An attempt to simplify shuffling phase when groupByKey(),reduceByKey() and aggregateByKey() is performed.
Add a comment...

Post has attachment
A small attempt to describe the partition concept in spark
Add a comment...

Post has attachment
This is an attempt to explain map() vs mapPartitions()
Add a comment...

Post has attachment
Its a step by step guide to set up spark with python on ubuntu.
Add a comment...

Post has attachment
I myself went into some confusions while learning Data Frames and RDD of pyspark.After learning and collecting from different sources, I thought of putting it in a single page so that if it can help some one.
Add a comment...
Wait while more posts are being loaded