Abstract
This paper describes an approach to case-based reasoning by which the case base is enriched at reasoning time. Enrichment results from the local application of variations to seed cases: new hypothetical cases are created which get closer and closer to the target problem. The creation of these hypothetical cases is based on structures associated to the problem and solution spaces, called variation spaces, that enable defining a language of adaptation rules. Ultimately reaching the target problem (exactly or nearly) allows the system to deliver a solution. Application of the proposed approach to machine translation shows behind state-of-the-art, but promising results.
The authors want to thank the anonymous reviewers for their detailed comments. They have tried to do their best to take them into account. The first author is supported by JSPS Grant-In-Aid 18K11447: “Self-explainable and fast-to-train example-based machine translation using neural networks”.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Some authors consider that analogy requires additional postulates (Lepage [10]). However, only these three postulates are used in this paper.
- 2.
When conformity is an equivalence relation, reflexivity and symmetry are straightforward. But conformity is not necessarily an equivalence relation.
- 3.
The LCS (“longest common subsequence”) distance is an edit distance based on the character insertion and character deletion edit operations, with a cost of 1 for both. In other terms, if P and Q are two strings and L is the LCS of P and Q, then \(\texttt {{dist}}(P, Q)=(|P|-|L|) + (|Q|-|L|)\).
- 4.
If uncertainty is modeled thanks to a probability measure, it is possible to associate to a probability \(P\in [0;1]\) a penalty \(\pi ={-}\log {}P\in [0;\infty ]\). If uncertainty is thought of as a measure of the gap to consistency with the real world, it is possible to associate a distance to it. Then, by definition, licit cases have a penalty of 0. The representation of uncertainty by penalties is chosen in this paper for generality of expression.
- 5.
Once again, costs could be associated to probabilities: \(\texttt {{cost}}(\texttt {{hypo}})\) could be defined by \({-}\log P(\varphi ^2~|~\varphi ^1)\), where \(P(\varphi ^2~|~\varphi ^1)\) is the probability of \(\varphi ^2\) being true given that \(\varphi ^1\) is. But they can also be associated to distance. \(\texttt {{cost}}(\texttt {{hypo}})\) could be defined as \(\texttt {{dist}}(\varphi ^1, \varphi ^2)\) which expresses the additional uncertainty on \(\varphi ^2\) when inferred from \(\varphi ^1\).
- 6.
\((\varDelta {\mathcal {P}}, {+})\) being a commutative group means that \(\varDelta {\mathcal {P}}\) is a set, that \(+\) is an associative and commutative operation on \(\varDelta {\mathcal {P}}\), and that every \(\overrightarrow{u}\in \varDelta {\mathcal {P}}\) has an inverse element \({-}\overrightarrow{u}\) (meaning \(\overrightarrow{u}+({-}\overrightarrow{u})=\overrightarrow{0}\)).
- 7.
A multiset is denoted with double braces; for example \(M=\left\{ \!\!\left\{ a, a, b, c, c, c\right\} \!\!\right\} \) contains a with multiplicity 2, b with multiplicity 1 and c with multiplicity 3. Thus the cardinality of M is \(\left| M\right| =2+1+3=6\).
- 8.
Note that this is the equality of two ratios. Of course, it is also an analogy by itself ( ), but this is not what is meant here.
- 9.
- 10.
https://lepage-lab.ips.waseda.ac.jp/ > Kakenhi 15K00317 > Tools.
- 11.
Remember that LCS distance is used: dist(très, trop) = 4 (two deletions and two insertions), not 2 (two substitutions) as would be the case with Levenshtein distance. .
- 12.
- 13.
References
Aamodt, A., Plaza, E.: Case-based reasoning: foundational issues, methodological variations, and system approaches. AI Commun. 7(1), 39–59 (1994)
Couceiro, M., Hug, N., Prade, H., Richard, G.: Analogy-preserving functions: a way to extend Boolean samples. In: IJCAI 2017, pp. 1575–1581 (2017)
Craw, S., Wiratunga, N., Rowe, R.C.: Learning adaptation knowledge to improve case-based reasoning. Artif. Intell. 170(16–17), 1175–1192 (2006)
d’Aquin, M., Badra, F., Lafrogne, S., Lieber, J., Napoli, A., Szathmary, L.: Case base mining for adaptation knowledge acquisition. In: IJCAI 2007, pp. 750–755 (2007)
Fam, R., Lepage, Y.: Tools for the production of analogical grids and a resource of n-gram analogical grids in 11 languages. In: Proceedings of LREC 2018, pp. 1060–1066. ELRA, May 2018
Hanney, K., Keane, M.T.: Learning adaptation rules from a case-base. In: Smith, I., Faltings, B. (eds.) EWCBR 1996. LNCS, vol. 1168, pp. 179–192. Springer, Heidelberg (1996). https://doi.org/10.1007/BFb0020610
Hussain, Z., Gimenez, F., Yi, D., Rubin, D.: Differential data augmentation techniques for medical imaging classification tasks. In: Annual Symposium Proceedings, pp. 979–984 (2017)
Jalali, V., Leake, D., Forouzandehmehr, N.: Learning and applying adaptation rules for categorical features: an ensemble approach. AI Commun. 30(3–4), 193–205 (2017)
Leake, D., Schack, B.: Exploration vs. Exploitation in case-base maintenance: leveraging competence-based deletion with ghost cases. In: Cox, M.T., Funk, P., Begum, S. (eds.) ICCBR 2018. LNCS (LNAI), vol. 11156, pp. 202–218. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01081-2_14
Lepage, Y.: Proportional analogy in written language data. In: Gala, N., Rapp, R., Bel-Enguix, G. (eds.) Language Production, Cognition, and the Lexicon. TSLT, vol. 48, pp. 151–173. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-08043-7_10
Lepage, Y.: Semantico-formal resolution of analogies between sentences. In: Proceedings of LTC 2019, pp. 57–61 (2019)
Lepage, Y., Denoual, E.: Purest ever example-based machine translation: detailed presentation and assessment. Mach. Transl. 19, 251–282 (2005)
Lepage, Y., Lieber, J.: Case-based translation: first steps from a knowledge-light approach based on analogy to a knowledge-intensive one. In: Cox, M.T., Funk, P., Begum, S. (eds.) ICCBR 2018. LNCS (LNAI), vol. 11156, pp. 563–579. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01081-2_37
Lieber, J., Nauer, E., Prade, H., Richard, G.: Making the best of cases by approximation, interpolation and extrapolation. In: Cox, M.T., Funk, P., Begum, S. (eds.) ICCBR 2018. LNCS (LNAI), vol. 11156, pp. 580–596. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01081-2_38
Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: BLEU: a method for automatic evaluation of machine translation. Technical report, IBM (2001)
Richter, M.M., Weber, R.: Case-Based Reasoning: A Textbook. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-40167-1
Taylor, L., Nitschke, G.: Improving deep learning using generic data augmentation. CoRR (2017)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Lepage, Y., Lieber, J. (2019). An Approach to Case-Based Reasoning Based on Local Enrichment of the Case Base. In: Bach, K., Marling, C. (eds) Case-Based Reasoning Research and Development. ICCBR 2019. Lecture Notes in Computer Science(), vol 11680. Springer, Cham. https://doi.org/10.1007/978-3-030-29249-2_16
Download citation
DOI: https://doi.org/10.1007/978-3-030-29249-2_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-29248-5
Online ISBN: 978-3-030-29249-2
eBook Packages: Computer ScienceComputer Science (R0)