Data Science Weekly - Issue 254
Issue #254 Oct 4 2018
Editor Picks
Calculating the Age of the Universe
Using SQL to query the HyperLEDA database and Axibase Time Series Database to store, process, and visualize the relevant information, a theoretical age of the universe can be calculated using a relatively straightforward procedure which is demonstrated in this article...
What Does it Take to Train Deep Learning Models On-Device?
Over the past few weeks,a few different people have asked me about the state of model training on phones and embedded devices. The good news is that it’s definitely possible, I know of multiple examples of teams doing this successfully. The bad news is that our tools don’t yet make it easy...
Data mining reveals the hidden laws of evolution behind classical music
Musicologists are beginning to uncover statistical patterns that govern how trends in musical composition have spread...
A Message from this week's Sponsor:
Find A Data Science Job Through Vettery
Vettery specializes in tech roles and is completely free for job seekers. Interested? Submit your profile, and if accepted onto the platform, you can receive interview requests directly from top companies growing their data science teams.
Get started.
Data Science Articles & Videos
RecSys 2018: Accepted Contributions
List of all long papers accepted for RecSys 2018 (in alphabetical order)...
How Big Data Is Changing Genetic Research
New biomedical techniques, like next-generation genome sequencing, are creating vast amounts of data and transforming the scientific landscape. They're leading to unimaginable breakthroughs — but leaving researchers racing to keep up...
Segmenting brain tissue using Apache MXNet with Amazon SageMaker and AWS Greengrass ML Inference – Part 1
Efficient neural networks are key to leveraging DL in offline environments at low cost. Learn how to train ENet for MRI segmentation easily using Amazon SageMaker on AWS, and deploy to the edge with AWS GreenGrass...
H3: Uber’s Hexagonal Hierarchical Spatial Index
Uber developed H3, our grid system for efficiently optimizing ride pricing and dispatch, for visualizing and exploring spatial data. H3 enables us to analyze geographic information to set dynamic prices and make other decisions on a city-wide level. We use H3 as the grid system for analysis and optimization throughout our marketplaces...
fastai v1 for PyTorch:
Fast and accurate neural nets using modern best practices
Today fast.ai is releasing v1 of a new free open source library for deep learning, called fastai. The library sits on top of PyTorch v1 (released today in preview), and provides a single consistent API to the most important deep learning applications and data types...
Large Scale GAN Training for High Fidelity Natural Image Synthesis
We present large-scale training of generative models, which sets the new state of the art in image synthesis on ImageNet...
Query Understanding, Divided into Three Parts
My latest post on query understanding breaks it up into three parts: holistic (whole query), reductionist (segmentation / entity recognition), and resolution...
Digging into Data Science Tools: Docker
Provides a high-level overview of what docker is, how it works, and why data scientists should learn about it...
Jobs
Data Scientist - Pear Therapeutics - San Francisco or Boston
At Pear Therapeutics, we have the privilege of building the world’s first-ever class of prescription digital therapeutics. By nature of our therapeutics as digital applications, we have access to rich datasets and unique opportunities to drive clinical outcomes. As a Data Scientist, you will be responsible for shaping and delivering data-driven insights. We are looking for data scientists with a deep product sense, who have an innate curiosity, and are eager to dive into large, complex datasets and create actionable insights....
Training & Resources
PyTorch Min: Get Minimum Value Of A PyTorch Tensor
Learn how to use PyTorch's min operation to calculate the min of a PyTorch tensor, via a screencast video and full tutorial transcript...
The Mechanics of Machine Learning
This book is a primer on machine learning for programmers trying to get up to speed quickly. You'll learn how machine learning works and how to apply it in practice...
TGM: PyTorch Geometry
We just open sourced the PyTorch Geometry package. A geometric computer vision library for PyTorch...
Books
Data Visualization with Python and JavaScript:
Scrape, Clean, Explore & Transform Your Data Learn how to turn raw data into rich, interactive web visualizations with the powerful combination of Python and JavaScript. With this hands-on guide, author Kyran Dale teaches you how build a basic dataviz toolchain with best-of-breed Python and JavaScript libraries—including Scrapy, Matplotlib, Pandas, Flask, and D3—for crafting engaging, browser-based visualizations...
For a detailed list of books covering Data Science, Machine Learning, AI and associated programming languages check out our resources page.
P.S., Want to reach our audience / fellow readers? Consider sponsoring - grab a spot now; first come first served! All the best, Hannah & Sebastian