Data Science Weekly - Issue 63
Issue #63 Feb 5 2015
Editor Picks
Visual Mapping of Twitch and Our Communities, ‘Cause Science!
Twitch has grown so quickly this year that it’s hard to keep track of all the amazing subgroups and communities that call Twitch home. To illustrate this, our Science team has recently been building visual maps of the Twitch world and we’re thrilled to share them with you!...
Shazam It! Music Processing, Fingerprinting, and Recognition
Shazam’s algorithm was revealed to world by it’s inventor Avery Li-Chung Wang in 2003. In this article we’ll go over the fundamentals of that algorithm...
Genetic Algorithm Walkers
What the hell is this? This observational pastime hopes to evolve walking creatures through genetic algorithms...
Data Science Articles & Videos
Surviving Data Science at the "Speed of Hype"
There is this idea endemic to the marketing of data science that big data analysis can happen quickly, supporting an innovative and rapidly changing company. But in my experience and in the experience of many of the analysts I know, this marketing idea bears little resemblance to reality...
The Man Who Knows Whether Any Startup Will Live or Die
Thomas Thurston thinks data science could remove a fair amount of the risk [of starting a business]. For the past nine years, he’s been honing techniques for evaluating business plans statistically rather than intuitively...
Quantity Versus Quality In Your Data Science Portfolio
A prospective employer will look at your online profiles. This will help them get a bigger picture of you than what was in your resume and profile you filled out for your data science job application. When they find your data science portfolio they will judge your work. Which brings up a very important question - should you focus on quality or quantity?...
Data Scientists Can Link Your Instagrams To Your Credit Card Purchases
When I tweeted from a Knicks game at Madison Square Garden on Dec. 2, I had no idea that data scientists could use that information to find out I’d used my MasterCard to buy an overpriced $12 beer — as well as identify all my other credit card purchases...
A Hierarchical Bayesian Drive-Survival Model of the NFL
In this post, I describe a model of the football drive as a piecewise exponential competing risks survival model. I then fit an example implementation, embedding the drive model within a Hierarchical Bayesian model of the NFL...
Stanford-Bred Startup Uses Moneyball Stats To Handicap Judges, Lawyers
If you’re being sued for patent infringement before U.S. District Judge Lucy Koh in the heart of California’s Silicon Valley, there’s something you ought to know...
The idea maze for AI startups
An “idea maze” is a map of all the key decisions and tradeoffs that startups in a given space need to make. I [Chris Dixon] thought it would be interesting to show an example of an idea maze for an area that I’m interested in: AI startups. Here’s a sketch of the maze. I explain each step in detail below...
Hacking Academia: Data Science and the University
A reflection on our SciFoo breakout session, where we discussed issues of data science within academia...
Data Science interviews: What can you expect?
Regularly, I receive questions from candidates interested in doing data science for consumer Internet companies asking me: “What do I need to get hired?”...
Jobs
Data Scientist - Contently - NYC Contently is on a mission to change the way content marketing is traditionally done. That means not only building a powerful technology but also leveraging a powerful network of creative experts that traditional marketers need to stay fresh and original. We are in search of a Data Scientist with a passion for discovering insight through data inference and exploration. Someone who geeks out over data and a strong understanding of statistics and machine learning combined with experience processing and generating large data sets. Working at Contently means that you will be collaborating with extremely intelligent, creative, and diverse problem-solvers who love a good story and many laughs...
Training & Resources
Notebook Gallery
Links to the best IPython and Jupyter Notebooks...
How to Create and Publish R package on CRAN
Step by step guide...
ReIntroducing Into: Clean data migration (with graphs!)
Tool to efficiently migrate data between formats...
Books
Street-Fighting Mathematics:
The Art of Educated Guessing and Opportunistic Problem Solving Not a new book, though very well reviewed...
"This book is a treasure trove of intuitive, practical, and brilliant mathematical techniques. Every person with an interest in mathematics, science, or engineering will enjoy this highly stimulating and fun book..."
For a detailed list of books covering Data Science, Machine Learning, AI and associated programming languages check out our resources page.
P.S. Enjoyed the newsletter? Please forward it along to friends and colleagues - we'd love to have them onboard! - All the best, Hannah & Sebastian