Abstract
English and Chinese language frequency time series (LFTS) were constructed based on an English and two Chinese novels. Methods of statistical hypothesis testing were adopted to test the nonlinear properties of the LFTS. Results suggest the series exhibited non-normal, auto-correlative, and stationary characteristics. Moreover, we found that LFTS follow the power law distributions, and thereby we investigated the fractal structure, long range correlation, and intermittency, which indicated the self-similarity features of LFTS, and also provided hints that human societies are likely to share some universal properties.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Zipf G K. Human Behavior and the Principle of Least Effort. Reading, MA: Addision-Wesley, 1949
Calderia S M G, Lobao T C P, Andrade R F S, et al. The network of concepts in written texts. Eur Phys J B, 2006, 49: 523–529
Masucci A P, Rodgers G J. Network properties of written human language. Phys Rev E, 2006, 76: 026102
Li J Y, Zhou J. Chinese character structure analysis based on complex networks. Physica A, 2007, 380: 629–638
Liu H T. Statistical properties of Chinese semantic networks. Chinese Sci Bull, 2009, 54: 2781–2785
Liu H T, Li W W. Language clusters based on linguistic complex networks. Chinese Sci Bull, 2010, 55: 3458–3465
Yu S Y, Chen Y D, Wu J R. Spectral integration and perception of Chinese back vowel /Γ/. Sci China Inf Sci, 2010, 53: 2300–2309
Schenkel A, Zhang J, Zhang Y C. Long range correlation in human writtings. Fractals, 1993, 1: 47–57
Amit M, Shemerler Y, Eisenberg E, et al. Language and codification dependence of long range correlation in texts. Fractals, 1994, 2: 7–13
Ausloos M. Equilibrium and dynamic methods when comparing an English text and its Esperanto translation. Physica A, 2008, 387: 6411–6420
Kosmidis K, Kalampokis A, Argyrakis P. Language time series analysis. Physica A, 2006, 370: 808–816
Carlos M J, Anil K B. Efficient tests for normality, homoscedasticity and serial independence of regression residuals. Econom Lett, 1980, 6: 255–259
Baum C F. An Introduction to Modern Econometrics Using Stata. College Station, Texas: Stata Press, 2006
Ljung G M, Box G E P. On a measure of a lack of fit in time series models. Biometrika, 1978, 65: 297–303
Mills T C. Time Series Techniques for Economists. New York: Cambridge University Press, 1990
Chen Y Z, He D R. A study on some urban bus transport networks. Physica A, 2007, 376: 747–754
Martinis M, Knezevic A, Krstacic G, et al. Changes in the Hurst exponent of heartbeat intervals during physical activity. Phys Rev E, 2004, 70: 012903
Carbonea A, Castelli G, Stanley H E. Time-dependent Hurst exponent in financial time series. Physica A, 2004, 344: 267–271
Peng C K, Buldyrev S V, Havlin S, et al. Mosaic organization of DNA nucleotides. Phys Rev E, 1994, 49: 1685–1689
Ivanova K, Ausloos M. Application of the detrended fluctuation analysis (DFA) method for describing cloud breaking. Physica A, 1999, 274: 349–354
Author information
Authors and Affiliations
Corresponding author
Additional information
This article is published with open access at Springerlink.com
Rights and permissions
This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.
About this article
Cite this article
Deng, W., Wang, D., Li, W. et al. English and Chinese language frequency time series analysis. Chin. Sci. Bull. 56, 3717–3722 (2011). https://doi.org/10.1007/s11434-011-4752-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11434-011-4752-0