Deep Learning Weekly Issue #125

Facebook's deepfake challenge, a new chip from Apple, TensorFlow.js for react-native, NeurIPS papers and more

Hey folks,

This week in deep learning, we bring you a deepfake detection challenge from Facebook, TensorFlow.js for react-native, an AI IoT development kit from Microsoft, and the list of papers accepted at NeurIPS 2019.

You may also enjoy a course on deep Bayesian methods, a new dataset of preference elicitation, an implementation of rotated Mask R-CNN, a repository with NLP models implemented in TensorFlow, a tutorial on attention-based OCR models, and more.

As always, happy reading and hacking. If you have something you think should be in next week's issue, find us on Twitter: @dl_weekly.

Until next week!


Facebook: Creating a data set and a challenge for deepfakes

Facebook launches a $10 million prize pool for research detecting deepfakes.

TensorFlow.js releases react-native plugin

There is now another way to run TensorFlow on mobile apps.

Microsoft’s Vision AI Developer Kit is now generally available [VentureBeat]

Train models using Azure Custom Vision service and run inference directly on an IoT camera with a Qualcomm Snapdragon processor.

When the A.I. Professor Leaves, Students Suffer, Study Says [New York Times]

When tech companies poach professors, grant funding to universities goes down and student-founded startups drop.

Apple's New A13 Bionic CPU to Power iPhone 11 with Longer Battery Life, Higher Speeds

Machine learning featured heavily in the iPhone Pro announcement, mostly related to computer vision.


NeurIPS 2019 Acceptances are out.

Based on titles alone, GANs, optimizers, and Bayesian approaches feature heavily.

Summer school on Deep Bayesian methods.

Videos, slides, and assignments for a course on deep bayesian networks.

SpeechBrain: A PyTorch-based Speech Toolkit.

A call for collaborators from an ambitious new project to create a speech toolkit in PyTorch.

Building Custom Deep Learning Based OCR models

A great tutorial on attention-based OCR models.


Google: Coached Conversational Preference Elicitation

Google announces a new dataset of conversations where an assistant elicits movie preferences from a user.

Libraries & Code

[GitHub] vietnguyen1991/AirGesture

Play games without touching they keyboard using a TensorFlow-based gesture recognizer.

[GitHub] mrlooi/rotated_maskrcnn

An implementation of Mask R-CNN with improved performance on rotated objects.

[GitHub] huseinzol05/NLP-Models-Tensorflow

An extensive list of NLP models implemented in TensorFlow.

Papers & Publications

Once for All: Train One Network and Specialize it for Efficient Deployment

Abstract: ….In this work, we introduce Once for All (OFA) for efficient neural network design to handle many deployment scenarios, a new methodology that decouples model training from architecture search. Instead of training a specialized model for each case, we propose to train a once-for-all network that supports diverse architectural settings (depth, width, kernel size, and resolution). Given a deployment scenario, we can later search a specialized sub-network by selecting from the once-for-all network without training. As such, the training cost of specialized models is reduced from O(N) to O(1). However, it's challenging to prevent interference between many sub-networks. Therefore we propose the progressive shrinking algorithm, which is capable of training a once-for-all network to support more than 1019 sub-networks while maintaining the same accuracy as independently trained networks, saving the non-recurring engineering (NRE) cost….

DeepPrivacy: A Generative Adversarial Network for Face Anonymization

Abstract: We propose a novel architecture which is able to automatically anonymize faces in images while retaining the original data distribution. We ensure total anonymization of all faces in an image by generating images exclusively on privacy-safe information. Our model is based on a conditional generative adversarial network, generating images considering the original pose and image background. The conditional information enables us to generate highly realistic faces with a seamless transition between the generated face and the existing background. Furthermore, we introduce a diverse dataset of human faces, including unconventional poses, occluded faces, and a vast variability in backgrounds. Finally, we present experimental results reflecting the capability of our model to anonymize images while preserving the data distribution, making the data suitable for further training of deep learning models. As far as we know, no other solution has been proposed that guarantees the anonymization of faces while generating realistic images.