Profile cover photo
Profile photo
Raghvendra Sharma
71 followers
71 followers
About
Raghvendra's posts

Post has attachment

Post has attachment
Apache Spark Interview Questions
1. What is Spark? Apache Spark is a fast, easy-to-use and flexible data processing framework. It has an advanced execution engine supporting cyclic data  flow and in-memory computing. Spark can run on Hadoop, standalone or in the cloud and is capable of acc...

Post has attachment
Handling data with changing schema on Hadoop
User data often is unpredictable, even when we can predict a
change coming our way we need to prepare for that. Make changes in our
environment to accept the incoming change, accommodate and absorb.  With that being a fact of life, our design should allow f...

Post has attachment
Hadoop - Small Files vs Big Files
Credits- https://blogs.msdn.microsoft.com/cindygross/2015/05/04/hadoop-likes-big-files/ One of the frequently overlooked yet essential best practices for Hadoop is to prefer fewer, bigger files over more, smaller files. How small is too small and how many i...

Post has attachment
Eclipse - installing Scala plugin manually?
I have been playing around with Scala for some time, and was always using the Scala IDE (www.scala-ide.org) which is based on a relatively older version of Eclipse (Luna). I recently discovered this, wherein you could install the scala plug-in on a regular ...

Post has attachment
Links to free big-data-sets
Many people who are starting their journey with big data and analytics find it hard to get their hands on the right kind of data to play or experiment with.   Most of the time, people have enthusiasm, they are learning the skill too, but they just don't hav...

Post has attachment
Partitioning in Informatica
Sourced from Internet All
transformations have some basic counters that indicate the number of input
rows, output rows, and error rows. Source Qualifier,
Normalizer, and target transformations have additional counters that indicate
the efficiency of data mo...

Post has attachment
Teradata Data type abbreviation - described
Teradata data types (as reported in DBC.Columns.ColumnType can be cryptic and not always easy to remember.  Here's a ready reckoner -  Abbreviation Equivalent
English :) A1 ARRAY   AN MULTI-DIMENSIONAL
ARRAY  AT TIME   BF BYTE   BO BLOB   BV VARBYTE   CF ...

Post has attachment
Hackathon - Fintechathon
A hackathon is a hack ing mar athon wherein many people are invited to attack problems around a theme. I recently attended a hackathon over the valentines weekend.  Organised by StartupBootCamp Fintech, it was attended by about 100 people. Many ideas, many ...
Wait while more posts are being loaded