Abstract
Historical accounts (ca 17th–19th centuries ce) are valuable for understanding how and where past Native American cultures used fire as a land management tool. Previous research has compiled and interpreted accounts, but methods of compiling them remains less systematic, leaving open the possibility that undiscovered accounts exist. This study uses information retrieval methods to locate accounts of Native American burning within digitized historical documents. Utilizing known accounts from digitized documents, this research develops a model to rank text portions within unread documents based on their predicted relevance. The model used frequencies of key terms and related textual features as predictors, and the presence and absence of accounts within text portions as the dependent variable. Within 121 documents related to western New York State (NYS), USA, this study discovered 40 accounts including 28 in western NYS. Of accounts in western NYS, 12 accounts (describing 21 locations) made explicit connections to Native American burning, were resolvable to town-level or finer resolution, and were not derivative of other texts. To locate all known accounts, the model aided in reducing the amount of text to read to only 0.61% of total. Locations of burning were 0.0 to 16.2 km (median = 5.6 km) from the nearest Native American village area, and 0.1 to 12.5 km (median = 1.5 km) from the nearest trail. This study demonstrates how information retrieval can discover accounts of Native American burning, and suggests that undiscovered historical accounts exist that may advance historical, cultural, and ecological understandings of burning practices.
Similar content being viewed by others
References
Abadir ER, Marschall JM, Dey DC, Stambaugh MC (2019) Historical fire regimes in red pine forests of the Adirondack Mountains, New York, USA. Nat Area J 39:226–236. https://doi.org/10.3375/043.039.0209
Abrams MD, Nowacki GJ (2008) Native Americans as active and passive promoters of mast and fruit trees in the eastern USA. Holocene 18:1,123–1,137. https://doi.org/10.1177/0959683608095581
Alex B, Grover C, Klein E, Tobin R (2012) Digitised historical text: does it have to be mediOCRe? In: Proceedings of KONVENS 2012 (LThist 2012 workshop). Vienna, Austria, pp 401–409
Black BA, Abrams MD (2001) Influences of Native Americans and surveyor biases on metes and bounds witness-tree distribution. Ecology 82:2,574–2,586. https://doi.org/10.1890/0012-9658(2001)082%5b2574:ionaas%5d2.0.co;2
Black BA, Ruffner CM, Abrams MD (2006) Native American influences on the forest composition of the Allegheny Plateau, Northwest Pennsylvania. Can J For Res 36:1,266–1,275. https://doi.org/10.1139/x06-027
Bowman DMJS, Balch J, Artaxo P et al (2011) The human dimension of fire regimes on Earth. J Biogeogr 38:2,223–2,236. https://doi.org/10.1111/j.1365-2699.2011.02595.x
Brose PH, Dey DC, Guyette RP, Marshall JM, Stambaugh MC (2013) The influences of drought and humans on the fire regimes of northern Pennsylvania, USA. Can J For Res 43:757–767. https://doi.org/10.1139/cjfr-2012-0463
Brown H (2000) Wildland burning by American Indians in Virginia. Fire Manag Today 60:29–39
Cappon LJ (ed) (1976) Atlas of Early American History: The Revolutionary Era, 1760-90. Princeton University Press, Princeton
Clayton WW (1879) History of Steuben County. Lewis, Peck & Co., Philadelphia, New York
Cleveland SC (1873) History and Directory of Yates County, vol 1. S.C Cleveland, Penn Yan, NY
Core Team R (2018) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna
Cronon W (1983) Changes in the Land: Indians, Colonists, and the Ecology of New England, 1st revised. Hill and Wang, New York
Dalal MK, Zaveri MA (2011) Automatic text classification: a technical review. Int J Comput Appl 28:37–40. https://doi.org/10.5120/3358-4633
Day GM (1953) The Indian as an ecological factor in the northeastern forest. Ecology 34:329–346. https://doi.org/10.2307/1930900
Denevan WM (1992) The pristine myth: the landscape of the Americas in 1492. Ann Assoc Am Geogr 82:369–385. https://doi.org/10.2307/2563351
Doty LR (1905) History of Livingston County, New York: from its earliest traditions to the present, together with Early Town Sketches. W. J Van Deusen, Jackson
Dwight T (1823) Travels in New-England and New-York. W. Baynes and Ogle, London
Elith J, Leathwick JR, Hastie T (2008) A working guide to boosted regression trees. J Anim Ecol 77:802–813. https://doi.org/10.1111/j.1365-2656.2008.01390.x
Engelbrecht W (2003) Iroquoia: The Development of a Native World. Syracuse University Press, Syracuse
Esri (2017) ArcGIS 10.5.1. Redlands, CA
Feinerer I, Hornik K (2018) tm: Text Mining Package. R package version 0.7-5. https://CRAN.R-project.org/package=tm
Feinerer I, Hornik K, Meyer D (2008) Text mining infrastructure in R. J Stat Softw 25:1–54
Fesenmyer KA, Christensen NL (2010) Reconstructing Holocene fire history in a southern Appalachian forest using soil charcoal. Ecology 91:662–670. https://doi.org/10.1890/09-0230.1
Filby PW (1985) A Bibliography of American County Histories. Genealogical Publishing Company, Baltimore
Friedman JH (2001) Greedy function approximation: a gradient boosting machine. Ann Stat 29:1,189–1,232
Friedman JH (2002) Stochastic gradient boosting. Comput Stat Data Anal 38:367–378. https://doi.org/10.1016/s0167-9473(01)00065-2
Grumet RS (1995) Historic Contact: Indian People and Colonists in Today’s Northeastern United States in the Sixteenth through Eighteenth Centuries. University of Oklahoma Press, Norman
Harris ZS (1954) Distributional Structure. Word 10:146–162. https://doi.org/10.1080/00437956.1954.11659520
Hays J, Post CF (1999) Journey on the forbidden path: chronicles of a diplomatic mission to the Allegheny country, March-September, 1760. American Philosophical Society, Philadelphia
Hijmans RJ, Phillips S, Leathwick J, Elith J (2013) Dismo: species distribution modeling. R package version 0.8-17
Hill MJ, Hengchen S (2019) Quantifying the impact of dirty OCR on historical text analysis: eighteenth century collections online as a case study. Digital Scholarship in the Humanities fqz024. https://doi.org/10.1093/llc/fqz024
Internet Archive (2019) About the Internet Archive. https://archive.org/about/
Jennings F, Fenton WN (1995) The history and culture of Iroquois diplomacy: an interdisciplinary guide to the treaties of the six nations and their league. Syracuse University Press, Syracuse
Jones EE (2010) An analysis of factors influencing sixteenth and seventeenth century Haudenosaunee (Iroquois) settlement locations. J Anthropol Archaeol 29:1–14. https://doi.org/10.1016/j.jaa.2009.09.002
Keeley JE, Aplet GH, Christensen NL et al (2009) Ecological foundations for fire management in North American Forest and Shrubland Ecosystems. United States Department of Agriculture, United States Forest Service, Pacific Northwest Research Station, Portland
Keister M (1998) Rush Oak openings unit management plan. NYS Department of Environmental Conservation, Bath
Khan A, Baharudin B, Lee LH, Khan K (2010) A review of machine learning algorithms for text-documents classification. J Adv Inf Technol 1:4–20
Lewis HT, Anderson MK (2002) Introduction. In: Lewis HT, Anderson MK (eds) Forgotten fires: Native Americans and the Transient Wilderness. University of Oklahoma Press, Norman
Maude J (1826) Visit to the Falls of Niagara, in 1800. Longman, London
McEwan RW, Dyer JM, Pederson N (2011) Multiple interacting ecosystem drivers: toward an encompassing hypothesis of oak forest dynamics across eastern North America. Ecography 34:244–256. https://doi.org/10.1111/j.1600-0587.2010.06390.x
McIntosh WH (1878) History of Ontario County. Everts, Ensign, & Everts, Philadelphia, PA, New York
McKinstry ER (1997) Personal accounts of events, travels, and everyday life in America: an annotated bibliography. The Henry Francis du Pont Winterthur Museum Inc, Winterthur
Minard JS (1896) Allegany County and its people. A centennial memorial history of Allegany County, New York. W. A. Fergusson & Co., Alfred
Morgan LH (1901) League of the Ho-De’-No-Sau-Nee, or Iroquois. Dodd, Mead and Company, New York
Morton T (1883) New english Canaan. The Prince Society, Boston
Munoz SE, Gajewski K (2010) Distinguishing prehistoric human influence on late-Holocene forests in southern Ontario, Canada. Holocene 20:967–981. https://doi.org/10.1177/0959683610362815
Parker AC (1920) The Archeological History of New York. The University of the State of New York, Albany
Parshall T, Foster DR (2002) Fire on the New England Landscape: regional and temporal variation, cultural and environmental controls. J Biogeogr 29:1,305–1,317. https://doi.org/10.2307/827553
Pyne SJ (1982) Fire in America: a cultural history of wildland and rural fire. Princeton University Press, Princeton
Roberts MF (1891) Historical Gazetteer of Steuben County. New York. Millard F, Roberts, Syracuse, NY
RStudio Team (2016) RStudio: integrated development environment for R. RStudio Inc, Boston
Russell EWB (1983) Indian-set fires in the forests of the Northeastern United States. Ecology 64:78–88. https://doi.org/10.2307/1937331
Scherjon F, Bakels C, MacDonald K, Roebroeks W (2015) Burning the land: an ethnographic study of off-site fire use by current and historically documented foragers and implications for the interpretation of past fire practices in the landscape. Curr Anthropol 56:299–326. https://doi.org/10.1086/681561
Smith BD (2011) General patterns of niche construction and the management of “wild” plant and animal resources by small-scale pre-industrial societies. Philos Trans R Soc B-Biol Sci 366:836–848. https://doi.org/10.1098/rstb.2010.0253
Snow DR (1996) The Iroquois. Blackwell Publishers Inc, Malden
Stewart OC (2002) Forgotten fires: Native Americans and the transient wilderness. University of Oklahoma Press, Norman
Tulowiecki SJ (2018) Information retrieval in physical geography: a method to recover geographical information from digitized historical documents. Prog Phys Geogr: Earth Environ 42:369–390. https://doi.org/10.1177/0309133318770972
Tulowiecki SJ, Larsen CPS (2015) Native American impact on past forest composition inferred from species distribution models, Chautauqua County, New York. Ecol Monogr 85:557–581. https://doi.org/10.1890/14-2259.1
Tulowiecki SJ, Robertson DS, Larsen CPS (2019) Oak savannas in western New York State, circa 1795: synthesizing predictive spatial models and historical accounts to understand environmental and Native American influences. Ann Assoc Am Geogr. https://doi.org/10.1080/24694452.2019.1629871
Turner O (1849) Pioneer history of the Holland purchase of western New York: embracing some account of the ancient remains… and a history of pioneer settlement under the auspices of the Holland company; including reminiscences of the war of 1812; the origin, progress and completion of the Erie canal, etc., etc., etc. Jewett, Thomas & Co., Buffalo
Vale TR (2002) The pre-European landscape: pristine or humanized? In: Vale TR (ed) Fire, native peoples, and the natural landscape. Island Press, Washington, DC, pp 1–39
Waite DB (1883) O-Neh-Da Te-Car-Ne-O-Di or Up and Down the Hemlock. G.E. Colvin & G.P, Waite, Canadice, NY
Whitney GG (1996) From coastal wilderness to fruited plain: a history of environmental change in temperate North America, 1500 to the present. Cambridge University Press, New York
Wickham H, François R, Henry L, Müller K (2018) dplyr: A grammar of data manipulation. R package version 0.7.6. https://CRAN.R-project.org/package=dplyr
Williams GW (2000) Introduction to aboriginal fire use in North America. Fire Manag Today 60:8–12
Williams GW (2005) References on the American Indian use of fire in ecosystems. USDA Forest Service, Washington, DC
Yale University (2019) eHRAF World Cultures. In: Human Relations Area Files. http://hraf.yale.edu/products/ehraf-world-cultures/. Accessed 11 January 2019
Acknowledgements
This research received support from the National Science Foundation under Grant No. 1660388. The authors thank two anonymous reviewers for providing suggestions on an earlier version of this paper.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by K. Brown.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Tulowiecki, S.J., Williams, S.V. & Oldendorf, M.E. Discovering accounts of Native American burning within digitized historical documents using information retrieval methods. Veget Hist Archaeobot 29, 463–476 (2020). https://doi.org/10.1007/s00334-019-00753-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00334-019-00753-2