Data Science Weekly - Issue 246
Issue #246 Aug 9 2018
Editor Picks
Rethinking Fast and Slow in Data Science:
How to make data science experiments agile
The tension between long-term planning and short-term flexibility is everywhere, including data science methodology. Is it possible for product development teams to reconcile rapid iteration with the slow-moving behemoth of the deep research process, or must they pick one?...
Here's How America Uses Its Land
There are many statistical measures that show how productive the U.S. is. Its economy is the largest in the world and grew at a rate of 4.1 percent last quarter, its fastest pace since 2014. The unemployment rate is near the lowest mark in a half century. What can be harder to decipher is how Americans use their land to create wealth...
Self Driving Car Learns Online and On-board on Raspberry Pi 3
We have been hard at work to create (to our knowledge) the world’s first fully online learning self-driving mini-car!...
A Message from this week's Sponsor:
Mode Studio: SQL, Python, R, & charts in one platform
No more jumping between applications. Mode Studio is the analytics toolkit that brings everything together, and gets out of the way. Explore data in our SQL editor, and pass results to integrated Python or R notebooks for deeper exploration and visualization. You can also layer charts over results quickly with built-in visualization tools, and sharing is easy—just send the report URL to teammates when you're ready...
Data Science Articles & Videos
TensorFlow 1.9 Officially Supports the Raspberry Pi
Thanks to a collaboration with the Raspberry Pi Foundation, we’re now happy to say that the latest 1.9 release of TensorFlow can be installed from pre-built binaries using Python’s pip package system!...
The Coca-Cola Company using TensorFlow for digital marketing campaigns
In this episode of TensorFlow Meets, Laurence sits down with Patrick Brandt, who is part of the Marketing Operations team at Coca-Cola North America. Learn about how TensorFlow was used to run one of Coca-Cola's biggest digital marketing campaigns, which involved image classification using deep neural networks for coca-cola bottle caps...
In search of decentralized data markets
Here are some introductory notes about smart contracts and related technology leading toward decentralized data markets...
Neural Style Transfer:
Creating Art with Deep Learning using tf.keras and eager execution
In this tutorial, we will learn how to use deep learning to compose images in the style of another image (ever wish you could paint like Picasso or Van Gogh?)...
LiquidFun
I ported the LiquidFun wave machine to Observable (and latest three.js)...
Machine-Generated Knowledge Bases
Human-generated knowledge bases like Wikipedia have a recall problem. First, there are the articles that should be there but are entirely missing. The unknown unknowns...
An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling
We conduct a systematic evaluation of generic convolutional and recurrent architectures for sequence modeling. The models are evaluated across a broad range of standard tasks that are commonly used to benchmark recurrent networks. Our results indicate that a simple convolutional architecture outperforms canonical recurrent networks such as LSTMs across a diverse range of tasks and datasets, while demonstrating longer effective memory...
Urban Nation: The Rise of the American City
Explores the historic population of America's cities in depth and shows the transition to more urban and suburban areas over time. Of special interest is how the US Census Bureau made adjustments to the definition of "urban area" in the 1950 census to account for suburban growth...
Jobs
Data Scientist - Memorial Sloan Kettering Cancer Center - NYC
For the 28th year, MSK has been named a top hospital for cancer by U.S. News & World Report. We are proud to be on Becker’s Healthcare list as one of the 150 Great Places to Work in Healthcare in 2018, as well as one of Glassdoor’s Employees’ Choice Best Place to Work for 2018. We’re treating cancer, one patient at a time. Join us and make a difference every day.
We are seeking a Data Scientist / Senior Strategic Analyst (Ambulatory Care) who will serve as a champion for data-driven decision making within Ambulatory Care, particularly to optimize the use of our outpatient resources, improve the patient experience and ensure MSK’s healthy growth. The candidate will serve as analytic thought partner to the VP and own the strategic roadmap for iteratively developing analytical products and supporting those needs...
Training & Resources
Stack A List of TensorFlow Tensors Into One Tensor
Learn how to stack a list of TensorFlow Tensors of the same rank into one tensor by using tf.stack, via a screencast video and full tutorial transcript...
Why I Indent My Code 8 Spaces
I’ve been using 8 spaces for a long time now and I’ve found that it has a number of benefits...
AutoKeras: Automated machine learning (AutoML) package
Auto-Keras is an open source software library for automated machine learning (AutoML). The ultimate goal of AutoML is to allow domain experts with limited data science or machine learning background easily accessible to deep learning models. Auto-Keras provides functions to automatically search for architecture and hyperparameters of deep learning models...
Books
The Joy of x: A Guided Tour of Math, from One to Infinity "Delightful . . . easily digestible chapters include plenty of helpful examples and illustrations. You'll never forget the Pythagorean theorem again!"...
For a detailed list of books covering Data Science, Machine Learning, AI and associated programming languages check out our resources page.
P.S., Want to reach our audience / fellow readers? Consider sponsoring - grab a spot now; first come first served! All the best, Hannah & Sebastian