Stream

Join this community to post or comment
 
We’ve all read about the shortage of data scientists from McKinsey, heard about the salaries, and know about the volume of recruiter emails. As a practicing data scientist at Pivotal (a leading vendor in open source, big data platforms specifically used for data science), I was recently interviewed on careers in data science. Because it has been a popular topic on Data Science Central, I wanted to share some of this perspective and see what other practitioners thought.
7
7
ahmad damra's profile photoGiovanni Turra's profile photo
Add a comment...
 
Has anyone seen or used Forecastthis?

I had an existing R script cleaning and transforming data before running a random forest model with cross validation and then predicting. It was around 40 lines of code. It took a bit of work to tune and get the accuracy up.

I used the same source data (with no cleaning) on forecastthis and it cleaned the data, transformed and cycled through models before recommending a type of decision tree ensemble - the predictions came back that matched my R script.

Difference was the grunt work and model review has handled for me.

One thing I didn't test was that we can download the R/Java/Python code for deployment purposes.

This is big step forward and let's us remove any bias from the selection of models.

I'd love to see the resulting R code.

I don't think this will make data scientists redundant but it will help transform ones workflow IMOA.









1
1
Thierry Brouwers's profile photo
Add a comment...
 
NOAA’s Big Data Project Comes Into Focus

From the post: "What’s different about this newly announced NOAA program is not just the potential “big data” scope of the program but the way in which private sector cloud vendors are involved as intermediaries not only to the public but also to potential data vendors and resellers."
1
3
Mahendranath Reddy's profile photoJ.M. Auron's profile photo
Add a comment...
 
In this post, we’re going to break down 14 fantastic examples of complex data visualized. Below are two examples (click on picture to see better resolution):
4
2
Philip Miglinci (pmig)'s profile photoPhokgoân Chio̍h's profile photo
Add a comment...
 
You know that data lake you recently started filling? Its data ecosystem is already well on its way to a catastrophic tipping point. 
Big data has a big problem. You know that data lake you recently started filling? Its data ecosystem is already well on its way to a catastrophic tipping point. Has a platform vendor convinced you to dive in with visions of a serene data-lake-side cabin in the woods?
1
Add a comment...
 
Summer courses in London #bigdata #ehealth
ICT Training by Imperial College London and EIT ICT Labs
2
Add a comment...
 
Some companies really get big data. Not only do they realise size matters – they understand you also have to know what to do with it. Here’s a list of seven companies I think are at the top of the game, when it comes to cutting-edge use of data to strategically achieve business goals. If you run a business yourself and are interested in big data projects, there is something to be learned from every one of these. So in no particular order …
1
Add a comment...
 
400 Professional Resources - Covering data science, R, machine learning, data mining, deep learning, Python, NoSQL, statistics, and more. Includes code, data sets, recipes, tutorials, and case studies. Click here to check out this gold mine of information.
6
2
Russell Gould's profile photoMarcond Marchi's profile photo
Add a comment...

Gregory Mooney

Discussion  - 
 
A recap of the New England IoT Meetup in Boston last week on World IoTDay.
Industry experts and over 75 guests debated the value around IoT protocols, security, and theory in the upcoming IoT economy.
1
Add a comment...
 
6 #IoT Startups That Make Connecting Things To The #Cloud A Breeze 
1
Add a comment...
 
MapR declines Open Data Platform invitation, trades barbs with Hortonworks over its relevance to Hadoop and relationship with the ASF
1
Add a comment...
 
#Cloudcomputing spending to hit $32bn in 2015 http://snip.ly/MjF2
1
Add a comment...

Isaac Sacolick

Discussion  - 
 
What tools are you using to integrate data from SaaS platforms?
Top 100 Social CIO - I cover topics for technologists, CIO, CMO, and software developers. Agile, PMO, DevOps, Leadership, Business Intelligence, Big Data, Startups, Digital Marketing, Social networking, SaaS, content management, media, enterprise 2.0 and business transformation.
1
Add a comment...

J. Nolfo

Discussion  - 
 
Data Scientist are becoming increasingly in demand.  They also have a high average salary.
Strong math skills are a key to landing some of the best U.S. job opportunities -- particularly data scientist positions, according to a new report.
1
Add a comment...
 
Many data set resources have been published on DSC, both big and little data. Some associated with our data science apprenticeship. A list can be found here. Below is a repository published on Github, originally posted here.  
8
12
Marta Gonzalez's profile photoHans Wiggerman's profile photo
Add a comment...

Raja Balaji

Discussion  - 
 
One good book I shouldn't miss to learn about Data? Thanks for your recommendations!
1
SAS Software's profile photo
 
Are you looking for general information or is there a specific topic we can help you with? http://2.sas.com/6268fX16
Add a comment...
 
Starred articles were potential candidates for our picture of the week published in our weekly digest. Enjoy our new selection of articles and resources (R, data science, Python, machine learning etc.) Comments are from Vincent Granville. For a full list of all resources featured so far, click here.
1
Add a comment...

Lee Hawthorn

Discussion  - 
 
Parallel data modelling with cuda
My first experience of accelerating analytics using massively parallel High Performance Computing (HPC) hardware began in 1997 with my secondment to the Esprit funded HyperBank (High Performance Banking) project lead by a consortium of banks and universities across Europe.A primary aim of the project was to use HPC to accelerate real time analysis in areas such as credit risk to provide faster decisions to the customer in a highly competitive mar...
3
2
Diego Zviovich's profile photomonkut h's profile photo
Add a comment...
 
 
Data Will Get Bigger As Homes Become Smarter
Back in 2011, I wrote about data shedding its geeky persona and emerging as a valued component in every marketing agency. Since then, data has become a fully-fledged member of the marketing mix, informing campaigns from the ground up with crucial customer insights. Data drives targeted creative output and with it, return on investment.

Technology moves fast and we’re now seeing everyday objects becoming interactive platforms for branded communications.  

The household appliance

Come on - who really thought Amazon Dash was an April Fools joke? I did - why would I need a whole load of buttons around my house?

Well, it seems the retail giant is trailing in the US and is now extending the concept by offering manufactures of internet-enabled household appliances such as coffee machines and printers the ability to embed functionality within the devices to automatically reorder goods when supplies run low - all we now need is for the Amazon drone to be synced up with the home-bot so as to receive the parcel, unpack it and install it.  That said - if the home-bot gets it wrong we could all be drinking printer ink instead of a morning coffee!  Wouldn’t that be a nice treat!!?
Read the full article here - http://goo.gl/45U6ji
Posted by Dave Gurney of alchemetrics.co.uk
#smarthome   #data  
Back in 2011, I wrote about data shedding its geeky persona and emerging as a valued component in every marketing agency. Since then, data has become a fully-fledged member of the marketing mix, informing campaigns from the ground up with crucial customer insights. Data drives targeted creative output and with it, return on investment.Technology moves fast and we’re now seeing everyday objects becoming interactive platforms for branded communicat...
1
Martin Zeitler's profile photo
 
not even to start with the microwave emission of that not so smart "smart meters" - besides all the convenience sold, the whole radiation side-effects are muted by the industry.
Add a comment...