Abstract
Developing a deep learning application requires unique approaches and methods. These are discussed in two industrial applications. The architectures used are two classic ones: convolutional neural network (CNN) and long short-term memory (LSTM). The CNN application to identify hand-written choice options from a scanned image, faced issues of “pose” (oriental transformation) problem and “region segmentation.” The LSTM application to predict the equated monthly instalment by all customers had to overcome challenges of “multi-variate” and “multiple time series.” The evolution of a satisfactory solution for the customer in both these cases is described in this chapter. The importance of data analysis, data pre-processing and proper choice of hyper-parameters is seen through these experiments.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Raj, B. (2018, May). A simple guide to the versions of the inception network. Medium. https://towardsdatascience.com/a-simple-guide-to-the-versions-of-the-inception-network-7fc52b863202.
Colah blog. (2019, December). Understanding LSTM Networks. http://colah.github.io/posts/2015-08-understanding-LSTMs/.
Deshpande Adit. The-9-Deep-Learning-Papers-You-Need-To-Know-About https://adeshpande3.github.io/The-9-Deep-Learning-Papers-You-Need-To-Know-About.html.
Elton, D. (2017). http://moreisdifferent.com/2017/09/hinton-whats-wrong-with-CNNs.
Andrej Karpathy blog. (2015, May). The unreasonable effectiveness of recurrent neural networks. http://karpathy.github.io/2015/05/21/rnn-effectiveness/.
Krizhevsky, A., Sutskever, I., & Geoffrey, H. (2017). Imagenet classification with deep convolutional neural networks. Communications of the ACM, 60(6), 84–90. https://doi.org/10.1145/3065386. ISSN 0001-0782.
Szegedy, C., et al. (2015). Going deeper with convolutions. CVP Computer Vision Foundation.
Uijilings, J. R. R., et al. (2012). Selective search for object recognition. Technical report, 2012 IJCV.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Balakrishnan, A. (2021). Challenges and Solutions in Developing Convolutional Neural Networks and Long Short-Term Memory Networks for Industry Problems. In: Pandey, M., Rautaray, S.S. (eds) Machine Learning: Theoretical Foundations and Practical Applications. Studies in Big Data, vol 87. Springer, Singapore. https://doi.org/10.1007/978-981-33-6518-6_2
Download citation
DOI: https://doi.org/10.1007/978-981-33-6518-6_2
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-33-6517-9
Online ISBN: 978-981-33-6518-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)