Post has shared content
Defined by 3Vs that are velocity, volume, and variety of the data, big data sits in the separate row from the regular data. Though big data was the buzzword since last few years for data analysis, the new fuss about big data analytics is to build up…

Post has shared content
What are the differences between traditional or RDBMS and Hadoop database systems? Both traditional relational (RDBMS) and Hadoop database systems have similar functionalities in terms of collection, storage, processing, recovery, extraction and data…

Post has shared content
Why did we start on this path? It all starts with our customers’ hybrid data management strategy. The need to embrace the proliferation of data that is creating new opportunities for businesses to better understand their customers, their industry and…

Post has shared content
Apache Storm is a distributed real-time big data-processing system. Storm is designed to process vast amount of data in a fault-tolerant and horizontal scalable method. It is a streaming data framework that has the capability of highest ingestion rates.…
Apache Storm – Introduction
Apache Storm – Introduction
bigdatapath.wordpress.com

Post has shared content
In this article, we will understand the very basic question which the beginners in the field of Big Data have. That is What is the difference between Big Data and Apache Hadoop. 1. Introduction The difference between Big Data and Apache Hadoop is distinct…
Difference Between Bigdata and Hadoop
Difference Between Bigdata and Hadoop
bigdatapath.wordpress.com

Post has attachment
Apache Spark Interview Questions and Answer (100+)

Spark Interview Questions has a collection of more than 100+ questions with answers asked in the interview for freshers and experienced (Programming, Scenario-Based, Fundamentals, Performance Tunning based Question and Answer) so thought worth sharing.This App is intended to help Apache Spark Career Aspirants to prepare for the interview.

https://play.google.com/store/apps/details?id=com.navnath.software

Post has shared content
Hadoop greatly helps in storing and processing large data sets in a distributed computing environment. Today, the framework is largely adopted in IT solutions and hence the need for Hadoop experts who are trained in the field. Given below are some of the…

Post has shared content
Installing Java Syntax of java version command $ java -version Following output is presented. java version “1.7.0_71” Java(TM) SE Runtime Environment (build 1.7.0_71-b13) Java HotSpot(TM) Client VM (build 25.0-b02, mixed mode) Creating User Account System…
Hadoop Multi Node Clusters
Hadoop Multi Node Clusters
bigdatapath.wordpress.com

Post has shared content
Problem I need to export data from the Hadoop Distributed File System (HDFS) to a SQL Server database table. How can I do this? Solution Apache’s Sqoop allows for importing data from a database such as SQL Server to the HDFS, and for exporting data from…

Post has shared content
Hadoop is an open-source software framework for storing data and running applications on clusters of commodity hardware. It provides massive storage for any kind of data, enormous processing power and the ability to handle virtually limitless concurrent…
Hadoop advantages and disadvantages
Hadoop advantages and disadvantages
bigdatapath.wordpress.com
Wait while more posts are being loaded