Abstract
I review the background and some recent trends of a particular scholarly information network, arXiv.org, and discuss some of its implications for new scholarly publication models. If we were to start from scratch today to design a quality-controlled archive and distribution system for scientific and technical information, it could take a very different form from what has evolved in the past decade from pre-existing print infrastructure. Near-term advances in automated classification systems, authoring tools, and document formats will facilitate efficient datamining and long-term archival stability, and I discuss how these could provide not only more efficient means of accessing and navigating the information, but also more cost-effective means of authentication and quality control. Finally, I illustrate the use of machine learning techniques to analyze, structure, maintain, and evolve a large online corpus of academic literature. An emerging field of research can be identified as part of an existing corpus, permitting the implementation of a more coherent community structure for its network of practitioners.
Preview
Unable to display preview. Download preview PDF.
References
1. P. Ginsparg. Winners and losers in the global research village. In Sir R. Elliot and D. Shaw, editors, Electronic Publishing in Science I, Proceedings of joint ICSU Press/UNESCO conference, Paris, 1996 (copy at http://arXiv.org/blurb/pg96unesco.html). URL: http://users.ox.ac.uk/ icsuinfo/ginsparg.htm.
2. P. Ginsparg. Creating a global knowledge network. In Sir R. Elliot and D. Shaw, editors, Electronic Publishing in Science II, proceedings of joint ICSU Press/UNESCO conference, Paris. ICSU Press, 2001 (copy at http://arXiv.org/blurb/pg01unesco.html). URL: http://users.ox.ac.uk/ icsuinfo/ginspargfin.htm.
3. P. Ginsparg. Can peer review be better focused?, 2003, Science & Technology Libraries, to appear. URL: http://arXiv.org/blurb/pg02pr.html.
4. Brinkman Outlines Priorities, Challenges for APS in 2002. APS News, January 2002. URL: http://www.aps.org/apsnews/0102/010208.html.
5. M. E. J. Newman, Who Is the Best Connected Scientist? A Study of Scientific Coauthorship Networks, Lect. Notes Phys. 650, 337–370 (2004).
6. R.S. Berry. Is electronic publishing being used in the best interests of science? the scientist’s view. In Sir R. Elliot and D. Shaw, editors, Electronic Publishing in Science II, proceedings of joint ICSU Press/UNESCO conference, Paris, 2001. URL: http://users.ox.ac.uk/~icsuinfo/berryfin.htm.
7. S. Bachrach, R.S. Berry, M. Blume, T. von Foerster, A. Fowler, P. Ginsparg, S. Heller, N. Kestner, A. Odlyzko, A. Okerson, R. Wigington, and A. Moffat. Who should own scientific papers? Science, 281:1459–1460, 1998.URL: http://www.sciencemag.org/cgi/content/full/281/5382/1459.
8. S. Rogers and C. Hurt. How scholarly communication should work in the 21st century. The Chronicle of Higher Education, October 18:A56, 1989.
9. P. Ginsparg. First steps towards electronic research communication. Computers in Physics, 8(4 (Jul/Aug)), 1994.
10. S. Gass. Transforming scientific communication for the 21st century. Science and Technology Libraries, 19(3/4):3–18, 2001.
11. D. Stern. eprint moderator model, 1999, version dated Jan 25, 1999.URL: http://www.library.yale.edu/scilib/modmodexplain.html.
12. R. Kling, L. Spector, and G. McKim. Locally controlled scholarly publishing via the internet: The guild model. The Journal of Electronic Publishing, August, 2002. URL: http://www.press.umich.edu/jep/08-01/kling.html.
13. J. Willinsky. Copyright contradictions in scholarly publishing. First Monday, 7(11 (November)), 2002. URL: http://firstmonday.org/issues/issue7_11/willinsky/index.html.
14. J. Kleinberg. Bursty and hierarchical structure in streams. In Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2002. URL: http://www.cs.cornell.edu/home/kleinber/kleinber.html.
15. H. B. O’Connell. Physicists thriving with paperless publishing. HEP Lib. Web., 6:3, arXiv:physics/0007040, 2002. URL: http://arXiv.org/physics/0007040.
16. M. J. Kurtz, G. Eichhorn, A. Accomazzi, C. Grant, M. Demleitner, S. S. Murray, N. Martimbeau, and B. Elwell. The NASA astrophysics data system: Sociology, bibliometrics and impact. 2003. URL: http://cfa-www.harvard.edu/~kurtz/jasist-submitted.pdf.
17. P. Ginsparg, P. Houle, T. Joachims, and J.-H. Sul. Mapping subsets of scholarly information. PNAS, to appear, 2004. URL: http://arXiv.org/cs.IR/0312018.
18. P. Hayes and S. Weinstein. CONSTRUE/TIS: a system for content-based indexing of a database of news stories. In Annual Conference on Innovative Applications of AI, 1990.
19. G. Salton and C. Buckley. Term weighting approaches in automatic text retrieval. Information Processing and Management, 24(5):513–523, 1988.
20. M. Porter. An algorithm for suffix stripping. Program (Automated Library and Information Systems), 14(3):130–137, July 1980.
21. V. Vapnik. Statistical Learning Theory. Wiley, Chichester, GB, 1998.
22. T. Joachims. Text categorization with support vector machines: Learning with many relevant features. In Proceedings of the European Conference on Machine Learning, pages 137 – 142, Berlin, 1998. Springer. URL: http://www-ai.cs.uni-dortmund.de/DOKUMENTE/joachims_98a.ps.gz.
23. S. Dumais, J. Platt, D. Heckerman, and M. Sahami. Inductive learning algorithms and representations for text categorization. In Proceedings of ACM-CIKM98, November 1998.
24. T. Joachims. Learning to Classify Text Using Support Vector Machines – Methods, Theory, and Algorithms. Kluwer, 2002.
25. C. Burges. A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery, 2(2):121–167, 1998.
26. N. Cristianini and J. Shawe-Taylor. An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. Cambridge University Press, 2000.
27. I. Guyon, N. Matic, and V. Vapnik. Discovering informative patterns and data cleaning. In Advances in Knowledge Discovery and Data Mining, pages 181–203, 1996.
Author information
Authors and Affiliations
Editor information
Rights and permissions
About this chapter
Cite this chapter
Ginsparg, P. Scholarly Information Network. In: Ben-Naim, E., Frauenfelder, H., Toroczkai, Z. (eds) Complex Networks. Lecture Notes in Physics, vol 650. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-44485-5_15
Download citation
DOI: https://doi.org/10.1007/978-3-540-44485-5_15
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22354-2
Online ISBN: 978-3-540-44485-5
eBook Packages: Springer Book Archive