Abstract
Large Social Commerce Network sites like eBay have to constantly grapple with building scalable machine learning algorithms for search ranking, recommender systems, classification, and others. Large data availability is both a boon and curse. While it offers a lot more diverse observation, the same diversity with sparsity and lack of reliable labeled data at scale introduces new challenges. Also, availability of large data helps take advantage of correlational factors while requiring creativity in discarding irrelevant data. In this talk we will discuss all of this and more from the context of eBays large data problems.
Chapter PDF
Similar content being viewed by others
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sundaresan, N. (2011). Data Science and Machine Learning at Scale. In: Gunopulos, D., Hofmann, T., Malerba, D., Vazirgiannis, M. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2011. Lecture Notes in Computer Science(), vol 6911. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23780-5_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-23780-5_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23779-9
Online ISBN: 978-3-642-23780-5
eBook Packages: Computer ScienceComputer Science (R0)