Deep Learning on Big Data
Deep Learning is a family of machine learning methods based on the composition of multiple (simple) processing layers, in order to learn complex (nonlinear) functions, which can be used to cope with many challenging application scenarios (e.g., Computer Vision, Natural Language Processing, Speech Recognition and Genomics (Le Cun et al. 2015)).
In the last few years, advances in digital sensors, computation, communications, and storage technologies and the large diffusion of IoT devices facilitated the production of huge collections of heterogeneous data, which are also susceptible to rapid changes over time. The term Big Data is used to define this kind of phenomenon (Wu et al. 2014).
Deep learning (DL) includes a wide range of machine learning techniques that aims at training artificial neural networks (ANNs) composed of a large number of hidden layers. These methods have been effectively used to tackle different types of problems (e.g., Computer Vision, Natural...
- Bengio Y (2012) Deep learning of representations for unsupervised and transfer learning. In: Proceedings of ICML workshop on unsupervised and transfer learning, pp 17–36Google Scholar
- Byeon W, Breuel TM, Raue F, Liwicki M (2015) Scene labeling with LSTM recurrent neural networks. In: CVPR, IEEE Computer Society, pp 3547–3555Google Scholar
- Chandar APS, Lauly S, Larochelle H, Khapra MM, Ravindran B, Raykar V, Saha A (2014) An autoencoder approach to learning bilingual word representations. In: Proceedings of the 27th international conference on neural information processing systems – vol 2, NIPS’14. MIT Press, Cambridge, MA, pp 1853–1861, http://dl.acm.org/citation.cfm?id=2969033.2969034Google Scholar
- Dahl G, Ranzato M, rahman Mohamed A, Hinton GE (2010) Phone recognition with the mean-covariance restricted Boltzmann machine. In: Lafferty JD, Williams CKI, Shawe-Taylor J, Zemel RS, Culotta A (eds) Advances in neural information processing systems 23. Curran Associates, Inc., pp 469–477Google Scholar
- Glorot X, Bordes A, Bengio Y (2011) Domain adaptation for large-scale sentiment classification: a deep learning approach. In: Proceedings of the twenty-eight international conference on machine learning, ICMLGoogle Scholar
- Graves A, Fernández S, Liwicki M, Bunke H, Schmidhuber J (2007) Unconstrained online handwriting recognition with recurrent neural networks. In: Proceedings of the 20th international conference on neural information processing systems, NIPS’07. Curran Associates Inc., pp 577–584Google Scholar
- Huang PS, He X, Gao J, Deng L, Acero A, Heck L (2013) Learning deep structured semantic models for web search using clickthrough data. In: Proceedings of the 22nd ACM international conference on conference on information & knowledge management. ACM, pp 2333–2338Google Scholar
- Jia S, Lansdall-Welfare T, Cristianini N (2016) Gender classification by deep learning on millions of weakly labelled images. In: 2016 IEEE 16th international conference on data mining workshops (ICDMW). IEEE, pp 462–467Google Scholar
- Lebret R, Collobert R (2015) “The sum of its parts”: joint learning of word and phrase representations with autoencoders. CoRR abs/1506.05703Google Scholar
- Lv Y, Duan Y, Kang W, Li Z, Wang FY (2015) Traffic flow prediction with big data: a deep learning approach. IEEE Trans Intell Transp Syst 16(2):865–873Google Scholar
- Paula EL, Ladeira M, Carvalho RN, Marzagão T (2016) Deep learning anomaly detection as support fraud investigation in Brazilian exports and anti-money laundering. In: 2016 15th IEEE international conference on machine learning and applications (ICMLA). IEEE, pp 954–960Google Scholar
- Shen Y, He X, Gao J, Deng L, Mesnil G (2014) Learning semantic representations using convolutional neural networks for web search. In: Proceedings of the 23rd international conference on world wide web. ACM, pp 373–374Google Scholar
- Sutskever I, Vinyals O, Le QV (2014) Sequence to sequence learning with neural networks. In: Proceedings of the 27th international conference on neural information processing systems – vol 2, NIPS’14. MIT Press, Cambridge, MA, pp 3104–3112. http://dl.acm.org/citation.cfm?id=2969033.2969173Google Scholar
- Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Computer vision and pattern recognition (CVPR)Google Scholar
- Torrey L, Shavlik J (2009) Transfer learning. In: Soria E, Martin J, Magdalena R, Martinez M, Serrano A (eds) Handbook of research on machine learning applications and trends: algorithms, methods, and techniques, vol 1. IGI Global, Hershey, p 242Google Scholar
- Vincent P, Larochelle H, Bengio Y, Manzagol P (2008) Extracting and composing robust features with denoising autoencoders. In: Proceedings of the 25th international conference on machine learning, ICML’08, pp 1096–1103Google Scholar
- Wu Y, Wang Z, Ji Q (2013) Facial feature tracking under varying facial expressions and face poses based on restricted Boltzmann machines. In: 2013 IEEE conference on computer vision and pattern recognition, Portland, 23–28 June 2013, pp 3452–3459. https://doi.org/10.1109/CVPR.2013.443
- Zhang S, Yang M, Wang X, Lin Y, Tian Q (2013) Semantic-aware co-indexing for image retrieval. In: Proceedings of the IEEE international conference on computer vision, pp 1673–1680Google Scholar