Data Science Weekly - Issue 155
Issue #155 Nov 10 2016
Editor Picks
Delivering real-time AI in the palm of your hand
As video becomes an even more popular way for people to communicate, we want to give everyone state-of-the art creative tools to help you express yourself. We recently began testing a new creative-effect camera in the Facebook app that helps people turn videos into works of art in the moment...
LipNet: How easy do you think lipreading is?
LipNet is doing lipreading using Machine Learning, aiming to help those who are hard of hearing and can revolutionise speech recognition...
Software Dreams Up New Molecules in Quest for Wonder Drugs
Ingesting a heap of drug data allows a machine-learning system to suggest alternatives humans hadn’t tried yet...
A Message from this week's Sponsor:
Data Science has come a long way, so has finding a job
You're on the cutting edge of technology, why job hunt like it's '90s? Try Hired today and have 4,000+ top companies apply for the chance to interview you. Try Hired today!
Data Science Articles & Videos
Colors of the Rails: Every color of every line of every metro system
When a map is created for a metro, subway, or raid transit system, colors are assigned to each line for identification. After creating the Global Subway Spectrum a few years ago which catalogs all these colors, I wanted to revisit the idea and create something new with updated data that transit fans like myself would enjoy hanging on their wall...
Kaggle competition Painter by Numbers
This repository contains a 1st place solution for the Kaggle competition Painter by Numbers. Below is a brief description of the dataset and approaches I've used to build and validate a predictive model...
A Day in the Life of a Data Engineer
I’m hoping to share a glimpse of what it means to me to be a data engineer at Yelp, with responsibilities over data streams covering vastly different applications in a day-in-the-life blog post...
Degrees of freedom in planning, running, analyzing, and reporting psychological studies A checklist to avoid p-hacking
In this review article, we present an extensive list of 34 degrees of freedom that researchers have in formulating hypotheses, and in designing, running, analyzing, and reporting of psychological research. The list can be used in research methods education, and as a checklist to assess the quality of preregistrations and to determine the potential for bias due to (arbitrary) choices in unregistered studies...
Faster R-CNN Pedestrian and Car Detection
My code for Faster R-CNN pedestrian-and-car-detection blog post...
DeepMind and Blizzard to release StarCraft II as an AI research environment
Today at BlizzCon 2016 in Anaheim, California, we announced our collaboration with Blizzard Entertainment to open up StarCraft II to AI and Machine Learning researchers around the world...
Adversarial Neural Cryptography in Theano
Last week I read Abadi and Andersen’s recent paper [1], Learning to Protect Communications with Adversarial Neural Cryptography. I thought the idea seemed pretty cool and that it wouldn’t be too tricky to implement, and would also serve as an ideal project to learn a bit more Theano. This post describes the paper, my implementation, and the results...
Image-Color
A deep learning approach to colorizing images. The current model was trained on screenshots taken from Pokemon Silver, and tested on Pokemon Blue Version...
Jobs
Data Visualization Developer - American Progress - Washington, D.C. American Progress is looking for a creative and innovative Data Visualization Developer who understands and appreciates both the art and science of translating numbers into graphics and interactives that are easy to understand and use. The Data Visualization Developer will create dynamic and static media to visualize demographic data, numbers, trends, ideas, and comparative data in order to create compelling stories and help American Progress communicate progressive ideas and arguments...
Training & Resources
An Introduction to Deep Learning
To understand what deep learning is, we first need to understand the relationship deep learning has with machine learning, neural networks, and artificial intelligence...
20 Weird & Wonderful Datasets for Machine Learning
Findings from my hunt for amazing datasets...
Apache Spark: A Unified Engine for Big Data Processing
Highly technical but good overview of Spark. My Baidu teams use it for many data processing tasks...
Books
What is a p-value anyway?
34 Stories to Help You Actually Understand Statistics "Offers a fun introduction to the fundamental principles of statistics, presenting the essential concepts in thirty-four brief, enjoyable stories"...
For a detailed list of books covering Data Science, Machine Learning, AI and associated programming languages check out our resources page.
P.S. Interested in reaching fellow readers of this newsletter? Consider sponsoring! Email us for details :) - All the best, Hannah & Sebastian