Stream

 
Understand the basic parts of #Hadoop Ecosystem. Why Hadoop is so much popular, the power of Hadoop and some basic knowledge of each Hadoop components. Read More - http://goo.gl/l5kuvc
Understand the basic parts of Hadoop Ecosystem. Why Hadoop is so much popular, the power of Hadoop and some basic knowledge of each Hadoop components.
1
Add a comment...

Sanoop S Nair

Discussion  - 
 
Hi, 

Any one can give an example (or any tutorials) for boto kinesis .
1
Add a comment...
 
Hi,

Any one do Mongodb Hadoop Streaming (using python and mongo-hadoop Connector) pls share tutorials 
1
Add a comment...

prashant pandey

Discussion  - 
 
Do you consider security while implementing big data solutions? A short post about what you need to consider
http://www.thriveschool.blogspot.in/2014/02/big-data-enterprise-security.html
1
Add a comment...
 
 
How Are Companies Organising Their Big Data Initiatives - Infographic
http://bit.ly/15slKun
1
Add a comment...
 
A highly efficient  data computation middleware, esProc, is released to be complement shortages of Map reduce, making Hadoop stronger on big data analytics. http://www.raqsoft.com/product-esproc
1
Add a comment...

murali m

Discussion  - 
 
Hi Guys,
I need to do analysis based on the keywords in comments, the database is mysql and at present the data is not complex or large but in future it will be.
Can you guys suggest will hadoop work for this and if yes where do I start from.
1
Syed Abdul Kather's profile photo
 
Try to use lucence api .. it has very good tokenzier and analyzer
Add a comment...

ROSNI K V

Discussion  - 
 
Hi,I'm newbie to hadoop environment,Do you have any idea about how to solve this error,or what may be the reason behind this error?
http://pastebin.com/sTczLzze I'm using hadoop 1.0.4,and wrote map reduce in python(hadoop streaming is used)
Give me some solutions for efficient compilation of map reduce and python.

Thank You!!!
1
ROSNI K V's profile photoMohammad Tariq's profile photo
2 comments
 
Sorry,that was by mistake and correct error is available in the link now,please check
Thank you for the response!
Add a comment...

Mohankumar T

Discussion  - 
 
Hi guys, am gonna work on data analytics with hadoop...pls suggest me where to start and what have to learn.. thanks in advance! cheers :)
1
Mohammad Tariq's profile photo
 
Start with studying about the platform itself and try to analyse where does it fit. Hadoop is not the solution to each and every problem. See if it fits into your requirement.
Add a comment...

Raja Balaji

Discussion  - 
 
Great Community. Despite of having a lot of theoritical information about Hadoo and Map reduce, I have not done a system by myself. Someone here can help me do a basic hadoop with VMs to establish a good understanding ?
Thanks.
1
unmesha sreeveni's profile photoMohammad Tariq's profile photo
 
Free online hive database
1
Add a comment...
 
Webinar: Data Visualization - How to unlock value in Data – 14th May, 2014, 1 PM EST
[RSVP: https://attendee.gotowebinar.com/register/8098898554395991297]
 
Conveying meaning in data quickly is the focal point of analytics. Visual analytics helps you discover new relationships in data, prompts you to ask new questions, and helps you convey what you see to others. Join us for this webinar to learn how to unlock the potential of your data using data visualizations.
 
Save the date! 14th May, 2014 (Wednesday), Time: 1 PM EST
 
How to join?
Register here to join the webinar: https://attendee.gotowebinar.com/register/8098898554395991297
For more details and upcoming webinars, stay tuned to our webinar hub page: http://www.perceptive-analytics.com/data-visualization-designer/#webinar

Who should join?
This free webinar is an excellent opportunity for businesses/corporate professionals that deal with huge data trying to discover new relationships and unlock the potential in data using data visualizations.
Webinar Objectives:
·         Understand how to make sense of vast data quickly
·         Elicit questions you did not ask before
·         Using visualizations to discover new data relationships
·         Learn how data visualization can help identify hidden insights in data
·         Explore various visualizations hand-picked by experts
 
About Speaker:  Chaitanya Sagar, CEO of Perceptive Analytics.
(http://in.linkedin.com/in/chaitanyasagar/)
Chaitanya Sagar is the founder and CEO of Perceptive Analytics. He is a Chartered Accountant (equivalent to CPA) and also holds a MBA from the Indian School of Business. He has a total experience of 15 years serving 300+ clients from medium to large companies in the USA, India, Australia, Europe and Middle East. He is an expert in creating Data Visualizations and has made presentations at international conferences.
 
About Perceptive Analytics: 
 
Perceptive Analytics is a Data Analytics company, offering specialized services in Data Visualization, Dashboard Design, Marketing Analytics, Web Marketing Analytics, Spreadsheet Modeling and Application Solutions. We have the reputation of being a trusted advisor with a penchant to deliver compelling value. We help clients unlock hidden insights using our cutting edge data visualizations. The clientele we serviced include a wide range of companies from listed companies to start ups in Silicon Valley to privately owned multi-billion dollar companies.
RSVP here: https://attendee.gotowebinar.com/register/8098898554395991297
1
Add a comment...
 
 
Difference between esProc and Hadoop on big data computing.

esProc has more flexible and simpler parallel mechanism, which can achieve light-weight parallel commuting based on Hadoop, the computing performance  is close or even better than traditional database under small or mediate clusters.

MapReduce is a computing framework for job scheduling and fault tolerance in cluster computing; MapReduce is not involved in computing and does not offer basic function library

esProc concerns with computing with rich function library; esProc does not offer framework, scheduling and fault tolerance can be done by developers via function library

MapReduce is a complex framework particularly for large clusters, and the data exchange and fault tolerance method is inherent, which is difficult to tune in accordance with task features for better performance

esProc can flexibly control job scheduling and balance performance and fault tolerance as per the task features, thus light-weight parallel computing  can be realized in small and medium for higher performance
1
Add a comment...
 
Win Free E-book  Web Crawling and Data Mining with Apache Nutch

We have a few e-copies of Web Crawling and Data Mining with Apache Nutch ( http://www.attuneinfocom.com/contest/data-mining-with-apache-nutch.html )  available for review. 
1
Add a comment...
1
Add a comment...

Mustufa Rahi

Discussion  - 
 
For Hadoop Beginners.
Learn working with Hadoop from FREE ebook.
Download your FREE ecopy now.
1
1
Raja Balaji's profile photoBen Liu's profile photo
 
Thanks for the share.
Add a comment...

Mohammad Tariq
owner

Discussion  - 
 
Quite often while working with Pig you would have reached a situation wherein you found that your Pig scripts have reached a such a level of complexity that the flow of execution, and it’s relation to the MapReduce jobs being...
1
piyush pankaj's profile photo
 
nice post...
Add a comment...

hadoop pass

Discussion  - 
 


Hi,
Please find the link for requested PDF.

http://pappupass.com/Hadoop_Interview_question.pdf

Online version link
http://www.pappupass.com/class/index.php/hadoop/hadoop-interview-questions

Hadoop certification simulator link
www.pappupass.com

We had  launched the Hadoop Administrator Certification Simulator as well with 238 practice questions.

And our current Hadoop Developer Exam Simulator is already being used by more than 250 users and most of them already cleared the exam.

These are based on CCD-410 and CCA-410 latest syllabous.

Thanks
PappuPass Learning Resources
1
Add a comment...