Skip to main content
Log in

A hybrid (visual/natural) controlled language

  • Original Paper
  • Published:
Language Resources and Evaluation Aims and scope Submit manuscript

Abstract

We define the notion of controlled hybrid language that allows information share and interaction between a controlled natural language (specified by a context-free grammar) and a controlled visual language (specified by a Symbol-Relation grammar). We present the controlled hybrid language INAUT, used to represent nautical charts of the French Naval and Hydrographic Service (SHOM) and their companion texts (Instructions nautiques).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18

Similar content being viewed by others

Notes

  1. Since \(\{a^1,a^2,b^2,b^1\}\) is a set, there is no intrinsic order among its elements. But there is nevertheless an order in the word 5 and, despite appearances, this order is not provided by the numeric superscripts (which can be arbitrary), but solely by the “next” relations of set \(\{{\rm next}(a^1,a^2),{\rm next}(a^2,b^2),{\rm next}(b^2,b^1)\}\).

  2. A regular closed set is the topological closure of the interior of some set.

  3. Notice that these are topological and not geometric arrangements. In other words: the fact that we have used circles to draw shapes in Fig. 3 is unimportant, a region can be any shape, even non-connected and with “holes”. All that counts is whether x and y have, or not, a nonempty intersection, whether this intersection is a region or not, and whether x is, or not, included in y and vice-versa. For example, both in \(\textsf {EC}\) and in \(\textsf {PO}\), the intersection of x and y is nonempty, the difference is that in \(\textsf {PO}\) the intersection \(x\cap y\) is a region, while in \(\textsf {EC}\), it is part of the boundary of x and of y, and hence is not a region.

  4. We use a sans-serif font (as in \(\textsf {NTPP}\)) for logical predicates, and a roman font (as in \({\rm NTPP}\)) for r-items in the SR grammar.

  5. Our notion of language hybridity differs from the one given in Costagliola et al. (1999), which deals with visual languages only. In Costagliola et al. (1999), a hybrid language is defined as a visual language using both logical (Plex syntactic model) and spatial relations (Box syntactic model).

  6. http://www.shom.fr/les-produits/produits-nautiques/ouvrages-nautiques/instructions-nautiques/.

  7. http://www.nauticalcharts.noaa.gov/nsd/cpdownload.htm.

  8. https://www.ukho.gov.uk/PRODUCTSANDSERVICES/PAPERPUBLICATIONS/Pages/NauticalPubs.aspx.

  9. With the exception of Seaspeak (Strevens and Johnson 1983), a controlled natural language defined in 1985 by the International Maritime Lecturers Association. In 2001 it evolved into SMCP (Standard Marine Communication Phrases IMO 2005) which is still used today. These controlled natural languages, dedicated to oral communication between ships, are “human-oriented” (of type c in Kuhn 2014, p. 125).

  10. We have omitted coord attributes and used name attribute values as node identifiers.

  11. http://www.iho.int/iho_pubs/IHO_Download.htm.

  12. Although, it is noteworthy to mention the approach of Freund et al. (1998) that uses a boosting algorithm, RankBoost, to learn to order a set of elements encoded in terms of a set of features depending on the learning corpus and which is used, inter alia, by Walker et al. (2001) to train a sentence planner.

  13. http://www.crunchbase.com/organization/ilastic.

  14. In PENS, the naturalness criterion refers to the relation of a controlled language to the natural language it is based upon. Even if we consider controlled visual languages as being based, in some sense, on more general “natural” visual languages (as discussed, for example, in Machin 2007), similarly to the way architectural (“formal”) blueprints are based on the architect’s primordial (“natural”) sketches, we consider that such a relation would be much more difficult and ambiguous to describe than in the case of textual languages, and hence is not necessarily appropriate for a general classification scheme.

  15. http://neo4j.com/.

  16. https://github.com/Toblerity/Shapely.

  17. https://www.mysql.com/.

  18. http://www.nltk.org/.

  19. http://www.dabeaz.com/ply/.

  20. https://github.com/terre-virtuelle/navisu.

  21. http://worldwind.arc.nasa.gov/java/.

  22. An ill-formed splitting would be a splitting not respecting the syntax, semantics, style and legacy conventions.

References

  • Angelov, K., & Ranta, A. (2009). Implementing controlled languages in GF. In Fuchs, N. (Ed.) Proceedings of CNL 2009, Springer lecture notes in artifical intelligence (Vol. 5972. pp. 82–101).

  • Barkmeyer, E., & Mattas, A. (2012). A restricted English for constructing ontologies (RECON). In NIST interagency/internal report 7868.

  • Beazley, D. Python Lex-Yacc. http://www.dabeaz.com/ply/.

  • Bertin, J. (1999). Sémiologie graphique. Paris: Éditions de l’EHSS.

    Google Scholar 

  • Camilleri, J.J., Paganelli, G., & Schneider, G. (2014). A CNL for contract-oriented diagrams. In Proceedings of CNL 2014, LNAI (Vol. 8625, pp. 135–146).

  • Chomsky, N. (1956). Three models for the description of language. IEEE Transactions on Information Theory, 2, 113–124.

    Article  Google Scholar 

  • Cohen, W. W., Schapire, R. E., & Singer, Y. (1999). Learning to order things. Journal of Artificial Intelligence Research, 10(1), 243–270.

    Google Scholar 

  • Costagliola, G., Ferrucci, F., Polese, G., & Vitiello, G. (1999). Supporting hybrid and hierarchical visual language definition. In IEEE symposium on visual languages (pp. 236–243).

  • Ferrucci, F., et al. (1996). Symbol-Relation grammars: A formalism for graphical languages. Information and Computation, 131, 1–46.

    Article  Google Scholar 

  • Freund, Y., Iyer, R. D., Schapire, R. E., & Singer, Y. (1998). An efficient boosting algorithm for combining preferences. In Proceedings of the fifteenth international conference on machine learning, ICML’98 (pp. 170–178). San Francisco, CA: Morgan Kaufmann Publishers Inc.

  • Grant Head, C. (1991). Mapping as language or semiotic system: Review and comment. In D. M. Mark & A. U. Frank (Eds.), Cognitive and linguistic aspects of geographic space (pp. 237–262). berlin: Kluwer Academic Publishers.

    Chapter  Google Scholar 

  • Haralambous, Y., Sauvage-Vincent, J., & Puentes, J. (2014). INAUT, a controlled language for the French coast pilot books instructions nautiques. In Proceedings of CNL 2014, LNAI (Vol. 8625, pp. 102–111).

  • Ilastic. http://www.crunchbase.com/organization/ilastic.

  • IMO. (2005). IMO standard marine communication phrases, with pronunciation. International Maritime Organization.

  • International Hydrographic Association: Special publication S-57. http://www.iho.int/iho_pubs/IHO_Download.htm.

  • Kerpedjiev, S.M. (1992). Automatic generation of multimodal weather reports from datasets. In Proceedings of the third international conference on applied natural language processing (pp. 48–55).

  • Knuth, D. E. (1968). Semantics of context-free languages. Mathematical Systems Theory, 2, 127–145.

    Article  Google Scholar 

  • Kuhn, T. (2014). A survey and classification of controlled natural languages. Computational Linguistics, 40, 121–170.

    Article  Google Scholar 

  • Levenshtein, V. I. (1966). Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady, 10(8), 707–710.

    Google Scholar 

  • Li, Z. (1995). Maps as a visual language: A Chinese perspective. Ph.D. thesis, Wilfrid Laurier University.

  • Machin, D. (2007). Introduction to multimodal analysis. London: Bloomsbury Academic.

    Google Scholar 

  • Mann, W. C., & Thompson, S. A. (1988). Rhetorical structure theory: Towards a functional theory of text organization. Text, 8(3), 243–281.

    Google Scholar 

  • Marriott, K., Meyer, B., & Wittenburg, K.B. (1998). A survey of visual language specification and recognition. In: Visual language theory (pp. 5–85). Springer.

  • Menanteau, B. (Ed.) (2011). Édition des Instructions nautiques (procédure spécifique). SHOM.

  • Menanteau, B. (Ed.) (2013). Définition des Instructions nautiques du SHOM (norme). SHOM.

  • Morvan, S., Marques, D., Mens, J., Pensec, T., & Tritschler, C. NaVisu project. https://github.com/terre-virtuelle/navisu.

  • MySQL AB: MySQL. https://www.mysql.com/.

  • Nagi, K. (2013). A new representation of WordNet using graph databases. International Journal on Advances in Software, 6, 298–308.

  • NASA: WorldWind API. http://worldwind.arc.nasa.gov/java/.

  • Natural Language ToolKit. http://www.nltk.org/.

  • Nečas, I. (2011). BDD as a specification and QA instrument. Master’s thesis, Masaryk University.

  • Neo4j project. http://neo4j.com/.

  • NOAA: United States coast pilot. http://www.nauticalcharts.noaa.gov/nsd/cpdownload.htm.

  • Pool, J. (2006). Can controlled languages scale to the Web? In Proceedings of the 5th international workshop on controlled language applications (pp. 1–12).

  • Pratt-Hartmann, I. (2013). Twenty years of topological logic. In M. Raubal, D. M. Mark, & A. U. Frank (Eds.), Cognitive and linguistic aspects of geographic space, lecture notes in geoinformation and cartography (pp. 217–236). Berlin: Springer.

    Chapter  Google Scholar 

  • Randell, D.A., Cui, Z., & Cohn, A.G. (1992). A spatial logic based on regions and connection. In Proceedings of the 2nd international conference on knowledge representation and reasoning (pp. 165–176). Morgan Kaufmann.

  • Ranta, A. (2009). Grammatical framework: A multilingual grammar formalism. Language and Linguistics Compass, 3, 1242–1265.

    Article  Google Scholar 

  • Reiter, E., & Dale, R. (2000). Building natural language generation systems. Cambridge: Cambridge University Press.

    Book  Google Scholar 

  • Roth, M., & Frank, A. (2010). Computing EM-based alignments of routes and route directions as a basis for natural language generation. In Proceedings of the 23rd international conference on computational linguistics, (pp. 958–966). Association for Computational Linguistics.

  • Sauvage-Vincent, J., Haralambous, Y., & Puentes, J. (2015). Sentence ordering in electronic navigational chart companion text generation. In ENLG 2015: 15th European Workshop on natural language generation (pp. 66–70). Stroudsburg, PA: The Association for Computational Linguistics.

  • Schlichtmann, H. (1984). Characteristic traits of the semiotic system “map symbolism”. The Cartographic Journal, 22, 23–30.

    Article  Google Scholar 

  • Schwitter, R. (2002). English as a formal specification language. In: Proceedings of DEXA ’02 (pp. 228–232).

  • Shapely Python package. https://github.com/Toblerity/Shapely.

  • SHOM: Instructions nautiques. http://www.shom.fr/les-produits/produits-nautiques/ouvrages-nautiques/instructions-nautiques/.

  • SHOM. (2012). Symboles, abréviations et termes utilisés sur les cartes marines, Vol. INT 1, 5th edn.

  • Strevens, P., & Johnson, E. (1983). SEASPEAK: A project in applied linguistics, language engineering, and eventually ESP for sailors. The ESP Journal, 2(2), 123–129.

    Article  Google Scholar 

  • UKHO: Admiralty sailing directions. https://www.ukho.gov.uk/PRODUCTSANDSERVICES/PAPERPUBLICATIONS/Pages/NauticalPubs.aspx.

  • Varanka, D. (1991). An approach to map/text interrelationships. In D. M. Mark & A. U. Frank (Eds.), Cognitive and linguistic aspects of geographic space (pp. 285–294). Berlin: Kluwer Academic Publishers.

    Chapter  Google Scholar 

  • Walker, M.A., Rambow, O., & Rogati, M. (2001). Spot: A trainable sentence planner. In Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies (pp. 1–8). Association for Computational Linguistics.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yannis Haralambous.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Haralambous, Y., Sauvage-Vincent, J. & Puentes, J. A hybrid (visual/natural) controlled language. Lang Resources & Evaluation 51, 93–129 (2017). https://doi.org/10.1007/s10579-016-9339-6

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10579-016-9339-6

Keywords

Navigation