Although we are constantly exposed to written text and human speech, most of the data we encounter in policy settings is available in neat, tabular formats. In the public sector, for example, both surveys and administrative data tend to rely on closed-form questions to quality control responses to a finite set of possibilities, such as yes/no and agree/neutral/disagree. Forcing respondents to a well-defined response makes for neat datasets, but ignores the richness of natural language that could be harvested from open-ended questions. Language carries a tremendous amount of information, encoded in nouns, verbs, adjectives, and other parts of speech. Combinations of these words allow language to carry multi-layered information, conveying sentiments, ideas, facts, etc.