Deep Learning

1 downloads 0 Views 20MB Size Report
Deep Learning – An interdisciplinary View of. Learning ...... Google is taking over through their newly released library TensorFlow. → ... Written mainly in C++.
www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Deep  Learning  –  An  interdisciplinary  View  of  Learning  Algorithms  for  Remote  Sensing  Image Analysis 

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Who am I ? ➢

Dimitrios Marmanis

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Who am I ? ➢



Dimitrios Marmanis PhD Candidate ➢ German Aerospace Center → DLR-IMF ➢ Technical University of Munich → TUM

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Who am I ? ➢





Dimitrios Marmanis PhD Candidate ➢ German Aerospace Center → DLR-IMF ➢ Technical University of Munich → TUM Supervisors ➢ U. Stilla - TUM ➢ M. Datcu - DLR-IMF ➢ K. Schindler - ETHZ

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Who am I ? ➢







Dimitrios Marmanis PhD Candidate ➢ German Aerospace Center → DLR-IMF ➢ Technical University of Munich → TUM Supervisors ➢ U. Stilla - TUM ➢ M. Datcu - DLR-IMF ➢ K. Schindler - ETHZ Interests ➢ Deep Learning application on EO Data ➢ Advancement of Deep Learning Models

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Who am I ? ➢









Dimitrios Marmanis PhD Candidate ➢ German Aerospace Center → DLR-IMF ➢ Technical University of Munich → TUM Supervisors ➢ U. Stilla - TUM ➢ M. Datcu - DLR-IMF ➢ K. Schindler - ETHZ Interests ➢ Deep Learning application on EO Data ➢ Advancement of Deep Learning Models Focus ➢ Machine Learning, ➢ Remote Sensing ➢ Computer Vision

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Presentation Outline

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Presentation Outline ➢

Brief Introduction to Deep Learning & CNN

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Presentation Outline ➢

Brief Introduction to Deep Learning & CNN



Notable Breakthroughs in Computer Vision

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Presentation Outline ➢

Brief Introduction to Deep Learning & CNN



Notable Breakthroughs in Computer Vision



Important Findings in Remote Sensing & Astronomy

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Presentation Outline ➢

Brief Introduction to Deep Learning & CNN



Notable Breakthroughs in Computer Vision



Important Findings in Remote Sensing & Astronomy



Intriguing Properties of CNNs

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Presentation Outline ➢

Brief Introduction to Deep Learning & CNN



Notable Breakthroughs in Computer Vision



Important Findings in Remote Sensing & Astronomy



Intriguing Properties of CNNs



How to Get Into Deep Learning

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Brief Introduction to Deep Learning & CNN

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

What is Deep­Learning ? Multiple definitions, however they all agree in the following aspects:

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

What is Deep­Learning ? Multiple definitions, however they all agree in the following aspects: ➢

Multiple layers of processing units

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

What is Deep­Learning ? Multiple definitions, however they all agree in the following aspects: ➢

Multiple layers of processing units



End-to-end automatic learning features

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

What is Deep­Learning ? Multiple definitions, however they all agree in the following aspects: ➢

Multiple layers of processing units



End-to-end automatic learning features



Hierarchical feature representation → From low to high-level abstraction

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

What is Deep­Learning ? Multiple definitions, however they all agree in the following aspects: ➢

Multiple layers of processing units



End-to-end automatic learning features



Hierarchical feature representation → From low to high-level abstraction



Supervised or Unsupervised frameworks exist

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Traditional­Methods Vs. Deep­Learning

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Traditional­Methods Vs. Deep­Learning

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Traditional­Methods Vs. Deep­Learning

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Historical Evolution of  Deep Learning Models

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Historical Evolution of  Deep Learning Models

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

How CNNs Work ?  

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

How CNNs Work ?  



Architecture designed for processing data with spatial consistency – local trainable kernels

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

How CNNs Work ?  





Architecture designed for processing data with spatial consistency – local trainable kernels Learn hierarchical representations → depth of network

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

How CNNs Work ?  







Architecture designed for processing data with spatial consistency – local trainable kernels Learn hierarchical representations → depth of network Efficient for large images extents– local computations through shared weights (trainable kernels)

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

How CNNs Work ?  







Architecture designed for processing data with spatial consistency – local trainable kernels Learn hierarchical representations → depth of network Efficient for large images extents– local computations through shared weights (trainable kernels)

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

The First Breakthrough ­ 2012 ➢

AlexNet

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

The First Breakthrough ­ 2012 ➢

AlexNet : Large CNN net → Won 2012 ImageNet Large-Scale Visual Recognition Challenge (= Olympics of Computer Vision)

Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems (pp. 1097-1105).

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

The First Breakthrough ­ 2012 ➢



AlexNet : Large CNN net → Won 2012 ImageNet Large-Scale Visual Recognition Challenge (= Olympics of Computer Vision) First time CNN breakthrough in a “real-problem” task

Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems (pp. 1097-1105).

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

The First Breakthrough ­ 2012 ➢

AlexNet : Large CNN net → Won 2012 ImageNet Large-Scale Visual Recognition Challenge (= Olympics of Computer Vision)



First time CNN breakthrough in a “real-problem” task



CNN error-rate achiever 15.4% → second best entry had 26.2% error-rate

Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems (pp. 1097-1105).

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

The First Breakthrough ­ 2012 ➢

AlexNet : Large CNN net → Won 2012 ImageNet Large-Scale Visual Recognition Challenge (= Olympics of Computer Vision)



First time CNN breakthrough in a “real-problem” task



CNN error-rate achiever 15.4% → second best entry had 26.2% error-rate



Trained on 2 GPUs for ~ 5 days

Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems (pp. 1097-1105).

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

The First Breakthrough ­ 2012 ➢

AlexNet : Large CNN net → Won 2012 ImageNet Large-Scale Visual Recognition Challenge (= Olympics of Computer Vision)



First time CNN breakthrough in a “real-problem” task



CNN error-rate achiever 15.4% → second best entry had 26.2% error-rate



Trained on 2 GPUs for ~ 5 days



learning content from 15 million images

Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems (pp. 1097-1105).

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Notable Breakthroughs in Computer Vision

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Object Classification and Detection in Photos

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Object Classification and Detection in Photos ➢

Train a CNN on many millions of images examples

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Object Classification and Detection in Photos ➢



Train a CNN on many millions of images examples Current systems achieve superhuman performance (=5.1 %) → error rate of 3.57 % in the classification task

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Object Classification and Detection in Photos ➢





Train a CNN on many millions of images examples Current systems achieve superhuman performance (=5.1 %) → error rate of 3.57 % in the classification task Knowledge acquired during training is transferable to a plethora of different visionrelated tasks (transfer-learning)

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Object Classification and Detection in Photos ➢





Train a CNN on many millions of images examples Current systems achieve superhuman performance (=5.1 %) → error rate of 3.57 % in the classification task Knowledge acquired during training is transferable to a plethora of different visionrelated tasks (transfer-learning)

Simonyan, K., & Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556. He, K., Zhang, X., Ren, S., & Sun, J. (2015). Deep residual learning for image recognition. arXiv preprint arXiv:1512.03385.

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Grayscale Image Colorization

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Grayscale Image Colorization ➢

Colorize multimedia-images & video frames

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Grayscale Image Colorization ➢



Colorize multimedia-images & video frames Impressive results for images similar to ImageNet, but not necessarily for every type of image

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Grayscale Image Colorization ➢



Colorize multimedia-images & video frames Impressive results for images similar to ImageNet, but not necessarily for every type of image

Zhang, R., Isola, P., & Efros, A. A. (2016). Colorful Image Colorization. arXiv preprint arXiv:1603.08511. http://richzhang.github.io/colorization/

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Automatic Handwriting Generation

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Automatic Handwriting Generation ➢

Learn the relationship between the pen movement (coordinates) and and respective letters

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Automatic Handwriting Generation ➢



Learn the relationship between the pen movement (coordinates) and and respective letters Through gained knowledge new text can be generated on the fly using a learned style

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Automatic Handwriting Generation ➢



Learn the relationship between the pen movement (coordinates) and and respective letters Through gained knowledge new text can be generated on the fly using a learned style

Graves, A. (2013). Generating sequences with recurrent neural networks.arXiv preprint arXiv:1308.0850. http://www.cs.toronto.edu/~graves/handwriting.html

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Automatic Caption Generation ➢

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Automatic Caption Generation ➢

Generate coherent sentences describing the image content

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Automatic Caption Generation ➢



Generate coherent sentences describing the image content Method : Break the problem into parts → Object detection with CNNs and sentence generation with LSTMs (Long-Short Term Memory Networks)

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Automatic Caption Generation ➢





Generate coherent sentences describing the image content Method : Break the problem into parts → Object detection with CNNs and sentence generation with LSTMs (Long-Short Term Memory Networks) Work on images and video as well

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Automatic Caption Generation ➢





Generate coherent sentences describing the image content Method : Break the problem into parts → Object detection with CNNs and sentence generation with LSTMs (Long-Short Term Memory Networks) Work on images and video as well

Karpathy, A., & Fei-Fei, L. (2015). Deep visual-semantic alignments for generating image descriptions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 3128-3137). http://cs.stanford.edu/people/karpathy/deepimagesent/

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Image and Video Style Transfer

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Image and Video Style Transfer ➢

Image Style Transfer (texture transfer)

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Image and Video Style Transfer ➢



Image Style Transfer (texture transfer) Main intuition : Retain structure of image (contect) and “superimpose” a particular style (texture)

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Image and Video Style Transfer ➢





Image Style Transfer (texture transfer) Main intuition : Retain structure of image (contect) and “superimpose” a particular style (texture) Used for artistic style transfer (paintings) and Photorealistic image styling

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Image and Video Style Transfer ➢





Image Style Transfer (texture transfer) Main intuition : Retain structure of image (contect) and “superimpose” a particular style (texture) Used for artistic style transfer (paintings) and Photorealistic image styling

Gatys, L. A., Ecker, A. S., & Bethge, M. (2015). A neural algorithm of artistic style. arXiv preprint arXiv:1508.06576. Gatys, L. A., Ecker, A. S., & Bethge, M. (2016). Image style transfer using convolutional neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 2414-2423). http://www.genekogan.com/works/style-transfer.html

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Detecting Location of Image on a Global Scale

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Detecting Location of Image on a Global Scale ➢

Train a large CNN images and respective location on the world (Grid-like location)

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Detecting Location of Image on a Global Scale ➢

Train a large CNN images and respective location on the world (Grid-like location)



Training dataset ~91 million images and respective locations

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Detecting Location of Image on a Global Scale ➢

Train a large CNN images and respective location on the world (Grid-like location)



Training dataset ~91 million images and respective locations



Test the model on 2.3 million images (Flickr) – indoor & outdoor scenes

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Detecting Location of Image on a Global Scale ➢

Train a large CNN images and respective location on the world (Grid-like location)



Training dataset ~91 million images and respective locations



Test the model on 2.3 million images (Flickr) – indoor & outdoor scenes



Tested again human – model achieves superhuman performance

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Detecting Location of Image on a Global Scale ➢

Train a large CNN images and respective location on the world (Grid-like location)



Training dataset ~91 million images and respective locations



Test the model on 2.3 million images (Flickr) – indoor & outdoor scenes



Tested again human – model achieves superhuman performance

Weyand, T., Kostrikov, I., & Philbin, J. (2016). Planet-photo geolocation with convolutional neural networks. arXiv preprint arXiv:1602.05314. https://www.geoguessr.com/

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Important Findings in Remote Sensing &  Astronomy

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

RS Image Classification with Pre­Trained CNNs

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

RS Image Classification with Pre­Trained CNNs ➢

Proposed Model → two-step approach for semantic classification of RS images

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

RS Image Classification with Pre­Trained CNNs ➢



Proposed Model → two-step approach for semantic classification of RS images Approach : feature extraction from a pre-trained CNN model (Overfeat) & with a trainable CNN model on top

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

RS Image Classification with Pre­Trained CNNs ➢





Proposed Model → two-step approach for semantic classification of RS images Approach : feature extraction from a pre-trained CNN model (Overfeat) & with a trainable CNN model on top Pre-trained model generating the feature descriptors from ImageNet dataset– no knowledge of remote sensing images

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

RS Image Classification with Pre­Trained CNNs ➢







Proposed Model → two-step approach for semantic classification of RS images Approach : feature extraction from a pre-trained CNN model (Overfeat) & with a trainable CNN model on top Pre-trained model generating the feature descriptors from ImageNet dataset– no knowledge of remote sensing images Test performance on UC-Merced Landuse classification benchmark → 21 semantic classes like sparse residential, medium residential, buildings, tennis-fields etc.

➢ ➢ ➢ ➢ ➢

Marmanis, D., Datcu, M., Esch, T., & Stilla, U. (2016). Deep Learning Earth Observation Classification Using ImageNet Pretrained Networks. IEEE Geoscience and Remote Sensing Letters, 13(1), 105-109.

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

RS Image Classification with Pre­Trained CNNs ➢







Proposed Model → two-step approach for semantic classification of RS images Approach : feature extraction from a pre-trained CNN model (Overfeat) & with a trainable CNN model on top Pre-trained model generating the feature descriptors from ImageNet dataset– no knowledge of remote sensing images Test performance on UC-Merced Landuse classification benchmark → 21 semantic classes like sparse residential, medium residential, buildings, tennis-fields etc.

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

RS Image Classification with Pre­Trained CNNs

Marmanis, D., Datcu, M., Esch, T., & Stilla, U. (2016). Deep Learning Earth Observation Classification Using ImageNet Pretrained Networks. IEEE Geoscience and Remote Sensing Letters, 13(1), 105-109.

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

RS Image Classification with Pre­Trained CNNs

Marmanis, D., Datcu, M., Esch, T., & Stilla, U. (2016). Deep Learning Earth Observation Classification Using ImageNet Pretrained Networks. IEEE Geoscience and Remote Sensing Letters, 13(1), 105-109.

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Semantic Annotation of VHSR Image using CNNs

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Semantic Annotation of VHSR Image using CNNs ➢

Use an ensemble of pre-trained Computer Vision models for annotating Remotely Sensed data → resolution 5 to 9cm/ pixel

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Semantic Annotation of VHSR Image using CNNs ➢



Use an ensemble of pre-trained Computer Vision models for annotating Remotely Sensed data → resolution 5 to 9cm/ pixel Data intensities have extensive intra-class variability with an overall decreased interclass separation – increased importance of topology and context

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Semantic Annotation of VHSR Image using CNNs ➢





Use an ensemble of pre-trained Computer Vision models for annotating Remotely Sensed data → resolution 5 to 9cm/ pixel Data intensities have extensive intra-class variability with an overall decreased interclass separation – increased importance of topology and context We proved that different pre-trained models over the same architecture result in complementary outcomes-→ when combined achieve superior performance

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Semantic Annotation of VHSR Image using CNNs ➢







Use an ensemble of pre-trained Computer Vision models for annotating Remotely Sensed data → resolution 5 to 9cm/ pixel Data intensities have extensive intra-class variability with an overall decreased interclass separation – increased importance of topology and context We proved that different pre-trained models over the same architecture result in complementary outcomes-→ when combined achieve superior performance Structured models seem not to improve results – CNN results in structured predictions

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Semantic Annotation of VHSR Image using CNNs ➢







Use an ensemble of pre-trained Computer Vision models for annotating Remotely Sensed data → resolution 5 to 9cm/ pixel Data intensities have extensive intra-class variability with an overall decreased interclass separation – increased importance of topology and context We proved that different pre-trained models over the same architecture result in complementary outcomes-→ when combined achieve superior performance Structured models seem not to improve results – CNN results in structured predictions



Marmanis, D., Wegner, J. D., Galliani, S., Schindler, K., Datcu, M., & Stilla, U. (2016). Semantic segmentation of aerial images with an ensemble of CNNs. ISPRS Annals of Photogrammetry, Remote Sensing and Spatial Information Sciences, 3

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Semantic Annotation of VHSR Image using CNNs

Results on ISPRS Benchmark

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Semantic Annotation of VHSR Image using CNNs

Results on ISPRS Benchmark

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Semantic Annotation of VHSR Image using CNNs

Results on ISPRS Benchmark

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Classification of Galaxies Based on Morphology 

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Classification of Galaxies Based on Morphology  ➢

Competition with open open source code → Galaxy Zoo Challenge

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Classification of Galaxies Based on Morphology  ➢

Competition with open open source code → Galaxy Zoo Challenge



Task: Human-like performance on classification of galaxies → 37 galaxy classes

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Classification of Galaxies Based on Morphology  ➢

Competition with open open source code → Galaxy Zoo Challenge



Task: Human-like performance on classification of galaxies → 37 galaxy classes



Labels where acquired through crowdsourcing project → Galaxy Zoo website

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Classification of Galaxies Based on Morphology  ➢

Competition with open open source code → Galaxy Zoo Challenge



Task: Human-like performance on classification of galaxies → 37 galaxy classes



Labels where acquired through crowdsourcing project → Galaxy Zoo website



Best Model CNN – 7 layers (4 convolutional & fully connected) - No use of pretrained model (this aspect was not investigated)

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Classification of Galaxies Based on Morphology  ➢

Competition with open open source code → Galaxy Zoo Challenge



Task: Human-like performance on classification of galaxies → 37 galaxy classes



Labels where acquired through crowdsourcing project → Galaxy Zoo website





Best Model CNN – 7 layers (4 convolutional & fully connected) - No use of pretrained model (this aspect was not investigated) RMSE → 0.07492 → Highly accurate model

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

  

Intriguing Property of CNNs

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Network Initialization through Tranfer­Learning

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Network Initialization through Tranfer­Learning



In various cases no need to train a network from scratch (random initialization) → Use of an rich information pre-trained network may provide better results and reduced training time

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Network Initialization through Tranfer­Learning





In various cases no need to train a network from scratch (random initialization) → Use of an rich information pre-trained network may provide better results and reduced training time Pre-trained models → state-of-the art performance in a variety of vision related tasks

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Network Initialization through Tranfer­Learning







In various cases no need to train a network from scratch (random initialization) → Use of an rich information pre-trained network may provide better results and reduced training time Pre-trained models → state-of-the art performance in a variety of vision related tasks Plethora of freely available, ready to use pre-trained models

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Network Initialization through Tranfer­Learning









In various cases no need to train a network from scratch (random initialization) → Use of an rich information pre-trained network may provide better results and reduced training time Pre-trained models → state-of-the art performance in a variety of vision related tasks Plethora of freely available, ready to use pre-trained models Same network with different initialization and/or different architecture will probably produce slightly different results – > non-convex feature space

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Model­Zoo : An Open Repository for CNN Pre­trained  models 

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Model­Zoo : An Open Repository for CNN Pre­trained  models  ➢

Online repository with dozens of CNN pre-trained model

https://github.com/BVLC/caffe/wiki/Model-Zoo https://bitbucket.org/deeplab/deeplab-public/

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Model­Zoo : An Open Repository for CNN Pre­trained  models  ➢



Online repository with dozens of CNN pre-trained model Model specialization vary → visual classification, image similarity, robotics, speech, 3D reconstruction, contour-detection, etc.

https://github.com/BVLC/caffe/wiki/Model-Zoo https://bitbucket.org/deeplab/deeplab-public/

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Model­Zoo : An Open Repository for CNN Pre­trained  models  ➢





Online repository with dozens of CNN pre-trained model Model specialization vary → visual classification, image similarity, robotics, speech, 3D reconstruction, contour-detection, etc. Standardize format for easy share and use –> through Caffe Library

https://github.com/BVLC/caffe/wiki/Model-Zoo https://bitbucket.org/deeplab/deeplab-public/

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Model­Zoo : An Open Repository for CNN Pre­trained  models  ➢



Online repository with dozens of CNN pre-trained model Model specialization vary → visual classification, image similarity, robotics, speech, 3D reconstruction, contour-detection, etc.



Standardize format for easy share and use –> through Caffe Library



Unrestricted use (BVLC license)

https://github.com/BVLC/caffe/wiki/Model-Zoo https://bitbucket.org/deeplab/deeplab-public/

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Model­Zoo : An Open Repository for CNN Pre­trained  models  ➢



Online repository with dozens of CNN pre-trained model Model specialization vary → visual classification, image similarity, robotics, speech, 3D reconstruction, contour-detection, etc.



Standardize format for easy share and use –> through Caffe Library



Unrestricted use (BVLC license)



Ready run with minimal effort https://github.com/BVLC/caffe/wiki/Model-Zoo https://bitbucket.org/deeplab/deeplab-public/

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

  

How to Get Into Deep Learning

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Software Libraries

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Software Libraries ➢

More than 50 different libraries

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Software Libraries ➢



More than 50 different libraries Each software has a different approach and scope → eg. scientific experimentation, application-oriented, easy of use, etc.

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Software Libraries ➢





More than 50 different libraries Each software has a different approach and scope → eg. scientific experimentation, application-oriented, easy of use, etc. Google is taking over through their newly released library → TensorFlow

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Software Libraries ➢



More than 50 different libraries Each software has a different approach and scope → eg. scientific experimentation, application-oriented, easy of use, etc.



Google is taking over through their newly released library → TensorFlow



There are a few alternatives that won’t die soon → support from IBM and Facebook

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Software Libraries ➢



More than 50 different libraries Each software has a different approach and scope → eg. scientific experimentation, application-oriented, easy of use, etc.



Google is taking over through their newly released library → TensorFlow



There are a few alternatives that won’t die soon → support from IBM and Facebook



Import to think before you make a choice

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Three Most Important Software Libraries

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Three Most Important Software Libraries ➢

Caffe ➢ Written mainly in C++ ➢ Bindings in Matlab & Python ➢ Mainly targeting vision problems ➢ Very fast and modular

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Three Most Important Software Libraries ➢



Caffe ➢ Written mainly in C++ ➢ Bindings in Matlab & Python ➢ Mainly targeting vision problems ➢ Very fast and modular TensorFlow / Theano ➢ Symbolic expression compiler ➢ Allows automatic differentiation ➢ Symbolic flow graphs (TensorFlow) ➢ Distributed computation (TensorFlow) ➢ Most popular library – supported by Google

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Three Most Important Software Libraries ➢





Caffe ➢ Written mainly in C++ ➢ Bindings in Matlab & Python ➢ Mainly targeting vision problems ➢ Very fast and modular TensorFlow / Theano ➢ Symbolic expression compiler ➢ Allows automatic differentiation ➢ Symbolic flow graphs (TensorFlow) ➢ Distributed computation (TensorFlow) ➢ Most popular library – supported by Google Torch ➢ Matlab-like environment – LuaJIT ➢ Very advanced framework ➢ Can allow changes over the models on the fly

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

Further Reading on Deep Learning Very reach online repository ➢ ➢ ➢ ➢ ➢

Books Important publications Courses Video lectures Tutorials https://github.com/priyaank/deep-learning

www.DLR.de • IMF

BigSkyEarth – Sorrento October 2016

The End

Questions ???