3 skills a data scientist needs
O’Reilly’s Strata 2011 Conference is coming soon, and O’Reilly has updated their site with a great interview with LinkedIn Senior Scientist Pete Skomoroch. A short 4-minute video, he discusses important skills every data scientist needs ranging from statistics to knowledge of external API’s.
The first skill, as you might expect, is a base in statistics, algorithms, machine learning, and mathematics. “You need to have a solid grounding in those principles to actually extract signals from this data and build things with it,” Skomoroch said.
Second, a good data scientist is handy with a collection of open-source tools — Hadoop, Java, Python, among others. Knowing when to use those tools, and how to code, are prerequisites.
The third set of skills focus on making products real and making data available to users. “That might mean data visualization, building web prototypes, using external APIs, and integrating with other services,” Skomoroch said. In other words, this one’s a combination of coding skills, an ability to see where data can add value, and collaborating with teams to make these products a reality.
Registration for StrataConf is still going strong, and using the special VizWorld conference registration code ‘str11vzw’ you can get 25% off, dropping the current $1895 price tag down to $1421!
Or, much to my disappointment again, you can use O’Reilly’s own str11rad code for 30% off.