Data Science Weekly - Issue 124
Issue #124 April 7 2016
Editor Picks
Doing Data Science Right — Your Most Common Questions Answered
In this article, we've summarized the advice we give to founders who are interested in building data science teams. We explain why data science is so important for many startups, when companies should begin investing in it, where to put data science in their organization and how to build a culture where data science thrives...
Catching Star Wars surprises and other spoilers with Machine Learning
In this post she describes Fanguard, the tool she built at Insight to protect Tumblr readers from spoilers for blockbuster movies and popular TV shows...
Ask the Analyst: How Does Data Science Work at Twitch?
We just published an interview with Drew Harry, the Director of Science at Twitch. In it, Drew discusses how he and his team: Structure the Science team; Balance qualitative research with quantitative analysis; Define best practices for collaborating with other teams; and Promote the value of data science throughout the company...
A Message from this week's Sponsor:
Learn Functional Programming from Experts in 12 weeks, Tuition Free.
DataScience, Inc. is launching DS12, a residency program that will teach 12 qualified candidates functional programming skills and prepare them with the tools necessary to succeed as part of the world's leading data science teams. To learn more and to apply for our inaugural session beginning June 13th, click here
Data Science Articles & Videos
Deep Learning and the Future of AI
Prof. Yann LeCun (Director of AI Research at Facebook & Professor at NYU). Talk from CERN, Geneva, March 2016..
Taxi, Uber, and Lyft Usage in New York City
Open TLC data reveals the taxi industry’s contraction, Uber’s growth, and Lyft’s apparent struggle for traction...
Your Future Toyota May Know Where You're Going Before You've Told It
Toyota’s new subsidiary will manage the troves of data collected from its increasingly connected cars....
SORTING
A visualization of the most famous sorting algorithms ...
How to approach machine learning as a non-technical person
This post is not a primer on ML technology; this post won’t pretend to give you an explanation of deep learning or any specific technology, because these concepts change frequently and are largely irrelevant to much of the decision making. Instead, this post will address how to assess the technology and determine if it will yield pragmatic business value...
Guess The Correlation
How good are you at guessing correlation coefficients from scatter plots? Test your skills! ...
Deep3D: Automatic 2D-to-3D Video Conversion with CNNs
Wouldn't it be cool if 2D-to-3D conversion can be done automatically, if you can take a 3D selfie with an ordinary phone?...
Music Language Modeling with Recurrent Neural Networks
I trained a Long Short-Term Memory (LSTM) Recurrent Neural Network on a dataset of around 650 jigs and folk tunes. I sampled from this model to generate the following musical pieces...
When(ish) is My Bus? User-centered Visualizations of Uncertainty in Everyday, Mobile Predictive Systems
Users often rely on realtime predictions in everyday contexts like riding the bus, but may not grasp that such predictions are subject to uncertainty...
Jobs
Data Scientist - Colgate-Palmolive - Piscataway, New Jersey Colgate-Palmolive is a leading global consumer products company, tightly focused on Oral Care, Personal Care, Home Care and Pet Nutrition. This position is responsible for the design and development of prospective methods/tools that use data to predict clinical/consumer and finished product outcomes. These tools, infrastructure and methods will ensure that data is retained, accessible and fully leveraged by the organization to make better decisions...
Training & Resources
Markov Chains Through the Lens of Dynamical Systems
We start by introducing the notion of the expected motion of a stochastic process or a Markov chain...
Modern Pandas (Part 2): Method Chaining
This is part 2 in my series on writing modern idiomatic pandas...
Generating Training Data
Generating your own training data for machine learning with localturk...
Books
How to Bake Pi: An Edible Exploration of the Mathematics of Mathematics Accessible introduction to the logic of mathematics-sprinkled throughout with recipes...
"This is the best book about math that I've ever read. This coming from someone who had loved math from a young age, majored in math, and have taught math for several years..."
For a detailed list of books covering Data Science, Machine Learning, AI and associated programming languages check out our resources page.
P.S. Interested in reaching fellow readers of this newsletter? Consider sponsoring! Email us for details :) - All the best, Hannah & Sebastian