Difference between revisions of "GP SSM"

From Robotics
Jump to: navigation, search
(Papers on Deep Learning Theory)
(Papers on Deep Learning Theory)
Line 47: Line 47:
 
=== Papers on Deep Learning ''Theory'' ===
 
=== Papers on Deep Learning ''Theory'' ===
 
* This is the paper that we seek to understand in this reading group.
 
* This is the paper that we seek to understand in this reading group.
** A. Achile and S. Soatto, [[Media:InvarianceDisentanglement.pdf | Emergence of Invariance and Disentanglement in Deep Representations]], ''arXiV:1706.01350v2'', Oct. 2017.
+
** A. Achille and S. Soatto, [[Media:InvarianceDisentanglement.pdf | Emergence of Invariance and Disentanglement in Deep Representations]], ''arXiV:1706.01350v2'', Oct. 2017.
* Here are some background papers
+
* Here are some background papers (i.e., links to many of the references in the Achille and Soatto paper)
 
** A. Achille and S. Soatto [https://arxiv.org/pdf/1611.01353.pdf  Information Dropout: Learning Optimal Representations Through Noisy Computation]
 
** A. Achille and S. Soatto [https://arxiv.org/pdf/1611.01353.pdf  Information Dropout: Learning Optimal Representations Through Noisy Computation]
 
** A. Alemi, I. Fischer, K. Dillon, and K. Murphey, [https://arxiv.org/abs/1612.00410 Deep Variational Information Bottleneck], ''arXiV:1612.00410''
 
** A. Alemi, I. Fischer, K. Dillon, and K. Murphey, [https://arxiv.org/abs/1612.00410 Deep Variational Information Bottleneck], ''arXiV:1612.00410''
Line 54: Line 54:
 
** Y. Bengio [https://www.iro.umontreal.ca/~lisa/pointeurs/TR1312.pdf Learning Deep Architectures for AI], 2009.
 
** Y. Bengio [https://www.iro.umontreal.ca/~lisa/pointeurs/TR1312.pdf Learning Deep Architectures for AI], 2009.
 
** J. Bruna and S. Mallat, [http://www.cmap.polytechnique.fr/scattering/scattering_cvpr2011.pdf Classification with Scattering Operators], ''CVPR'', 2011.
 
** J. Bruna and S. Mallat, [http://www.cmap.polytechnique.fr/scattering/scattering_cvpr2011.pdf Classification with Scattering Operators], ''CVPR'', 2011.
** P. Chaudhari, A. Choromanska, S. Soatto, Y. LeCun, C. Baldassi, C. Borgs, J. Chayes, L. Sagun, R. Zecchina [https://arxiv.org/abs/1611.01838 Entropy-SGD: Biasing Gradient Descent into Wide Valleys], ''Proceedings of the International Conference on Learning Representations (ICLR)'', 2017
+
** P. Chaudhari, A. Choromanska, S. Soatto, Y. LeCun, C. Baldassi, C. Borgs, J. Chayes, L. Sagun, R. Zecchina [https://arxiv.org/abs/1611.01838 Entropy-SGD: Biasing Gradient Descent into Wide Valleys], ''Proceedings of the International Conference on Learning Representations (ICLR)'', 2017.
 
** N. Tishby and N. Zaslavsky, [[Media:DeepLearningBottleneck.pdf | Deep Learning and the Information Bottleneck Principle]], ''arXiv:1503.02406'', March, 2015.
 
** N. Tishby and N. Zaslavsky, [[Media:DeepLearningBottleneck.pdf | Deep Learning and the Information Bottleneck Principle]], ''arXiv:1503.02406'', March, 2015.
** D.-A. Clevert, T. Unterthiner, and S. Hochreiter, [https://arxiv.org/abs/1511.07289 Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs)], ''arXiv:1511.07289''
+
** D.-A. Clevert, T. Unterthiner, and S. Hochreiter, [https://arxiv.org/abs/1511.07289 Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs)], ''arXiv:1511.07289''.
 +
** L. Dinh, R. Pascanu, S. Bengio, Y. Bengio [https://arxiv.org/abs/1703.04933 Sharp Minima Can Generalize For Deep Nets], ''arXiv:1703.04933''
 +
 
 
* P. Chauderi, A.Oberman, S. Osher, S. Soatto, G. Ccarlier, [[Media:DeepNetPDEs.pdf | Deep Relaxation: Partial Differential Equations for Optimizing Deep Neural Networks]], ''arXiV:1704.04932v2''
 
* P. Chauderi, A.Oberman, S. Osher, S. Soatto, G. Ccarlier, [[Media:DeepNetPDEs.pdf | Deep Relaxation: Partial Differential Equations for Optimizing Deep Neural Networks]], ''arXiV:1704.04932v2''
  

Revision as of 21:33, 16 January 2018

This page gathers references and materials related to the study of

  • Gaussian Process (GP) State Space Models (SSM)
  • Deep Learning
  • Koopman Spectral Methods.

Gaussian Process Approaches

Basic Gaussian Process Info

  • Rasmussen and Williams

Web Links

Papers on GP-SSMs


Deep Learning

Papers on Deep Learning Theory

Review-like papers

Papers on Scattering Networks

Scattering Networks are proposed by Stephan Mallat (of wavelet fame) to understand why deep nets work so well.

Classics

Web Links

Koopman Spectral Method

Papers on Koopman Spectral methods

Papers which are more oriented toward control

Papers which are more oriented toward fluids

Some Early Papers

Other Papers