A semiparametric generative model for efficient structured-output supervised learning

Costa, Fabrizio; Passerini, Andrea; Lippi, Marco; Frasconi, Paolo

doi:10.1007/s10472-009-9137-6

A semiparametric generative model for efficient structured-output supervised learning

Published: 05 May 2009

Volume 54, pages 207–222, (2008)
Cite this article

Annals of Mathematics and Artificial Intelligence Aims and scope Submit manuscript

Fabrizio Costa¹^nAff2,
Andrea Passerini¹^nAff3,
Marco Lippi¹ &
…
Paolo Frasconi¹

71 Accesses
Explore all metrics

Abstract

We present a semiparametric generative model for supervised learning with structured outputs. The main algorithmic idea is to replace the parameters of an underlying generative model (such as a stochastic grammars) with input-dependent predictions obtained by (kernel) logistic regression. This method avoids the computational burden associated with the comparison between target and predicted structure during the training phase, but requires as an additional input a vector of sufficient statistics for each training example. The resulting training algorithm is asymptotically more efficient than structured output SVM as the size of the output structure grows. At the same time, by computing parameters of a joint distribution as a function of the full input structure, typical expressiveness limitations of related conditional models (such as maximum entropy Markov models) can be potentially avoided. Empirical results on artificial and real data (in the domains of natural language parsing and RNA secondary structure prediction) show that the method works well in practice and scales up with the size of the output structures.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Natural language processing: state of the art, current trends and challenges

Article 14 July 2022

Natural language syntax complies with the free-energy principle

Article Open access 03 May 2024

Foundation and large language models: fundamentals, challenges, opportunities, and social impacts

Article 27 November 2023

References

Bosco. C., Lombardo, V., Vassallo, D., Lesmo, L.: Building a treebank for Italian: a data-driven annotation schema. In: Proceedings of the Second International Conference on Language Resources and Evaluation LREC, pp. 99–106, Athens, 31 May–2 June 2000
Collins, M.: Parameter estimation for statistical parsing models: theory and practice of distribution-free methods. In: New Developments in Parsing Technology, pp. 19–55. Kluwer Academic, Norwell (previusly IWPT 2001) (2004)
Google Scholar
Cortes, C., Mohri, M., Weston, J.: A general regression technique for learning transductions. In: Proceedings of the 22nd International Conference on Machine Learning, pp. 153–160, Bonn, 7–11 August 2005
Johnson, M.: PCFG models of linguistic tree representations. Comput. Linguist. 24(4), 613–632 (1998)
Google Scholar
Knudsen, B., Hein, J.: RNA secondary structure prediction using stochastic context-free grammars and evolutionary history. Bioinformatics 15(6), 446–454 (1999)
Article Google Scholar
Lafferty, J.D., McCallum, A., Pereira, F.C.N.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: ICML ’01: Proceedings of the Eighteenth International Conference on Machine Learning, pp. 282–289. Morgan Kaufmann, San Francisco (2001)
Google Scholar
Leslie, C., Eskin, E., Noble, W.: The spectrum kernel: a string kernel for svm protein classification. In: Proc. of the Pacific Symposium on Biocomputing, pp. 564–575, Lihue, 3–7 January 2002
Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing. MIT, Cambridge (1999)
MATH Google Scholar
McAllester, D.: Generalization bounds and consistency for structured labeling. In: Bakir, G., Hofmann, T., Schölkopf, B., Smola, A., Taskar, B., Vishwanathan, S.V.N. (eds.) Predicting Structured Data. MIT, Cambridge (2007)
Google Scholar
McCallum, A., Freitag, D., Pereira, F.C.N.: Maximum entropy markov models for information extraction and segmentation. In: Langley, P. (ed.) ICML, pp. 591–598. Morgan Kaufmann, San Francisco (2000)
Google Scholar
Menchetti, S., Costa, F., Frasconi, P.: Weighted decomposition kernels. In: Proceedings of the Twenty-second International Conference on Machine Learning (ICML’05), pp. 585–592. ACM, New York (2005)
Google Scholar
Sakakibara, Y., Brown, M., Hughey, R., Mian, I.S., Sjölander, K., Underwood, R.C., Haussler, D.: Stochastic context-free grammars for tRNA modeling. Nucleic Acids Res. 22, 5112–5120 (1994)
Article Google Scholar
Taskar, B., Guestrin, C., Koller, D.: Max-margin markov networks. In: Advances in Neural Information Processing Systems (NIPS 2003), Vancouver, 13–18 December 2004
Tsochantaridis, I., Joachims, T., Hofmann, T., Altun, Y.: Large margin methods for structured and interdependent output variables. J. Mach. Learn. Res. 6, 1453–1484 (2005)
MathSciNet Google Scholar
Weston, J., Chapelle, O., Elisseeff, A., Scholkopf, B., Vapnik, V.: Kernel dependency estimation. Adv. Neural Inf. Process. Syst. 15, 873–880 (2003)
Google Scholar
Zhu, J., Hastie, T.: Kernel logistic regression and the import vector machine. In: Advances in Neural Information Processing Systems (NIPS 2001), pp. 1081–1088, Vancouver, 3–8 December 2001

Download references

Author information

Fabrizio Costa
Present address: Departement Computerwetenschappen, Katholieke Universiteit Leuven, Celestijnenlaan 200 A, 3001, Heverlee, Belgium
Andrea Passerini
Present address: Dipartimento di Ingegneria e Scienza dell’Informazione, Università degli Studi di Trento, Sommarive st. 14, POVO, 38100, Trento, Italy

Authors and Affiliations

Dipartimento di Sistemi e Informatica, Università degli Studi di Firenze, Via di Santa Marta 3, 50139, Firenze, Italy
Fabrizio Costa, Andrea Passerini, Marco Lippi & Paolo Frasconi

Authors

Fabrizio Costa
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Passerini
View author publications
You can also search for this author in PubMed Google Scholar
Marco Lippi
View author publications
You can also search for this author in PubMed Google Scholar
Paolo Frasconi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marco Lippi.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Costa, F., Passerini, A., Lippi, M. et al. A semiparametric generative model for efficient structured-output supervised learning. Ann Math Artif Intell 54, 207–222 (2008). https://doi.org/10.1007/s10472-009-9137-6

Download citation

Received: 19 March 2009
Accepted: 19 March 2009
Published: 05 May 2009
Issue Date: November 2008
DOI: https://doi.org/10.1007/s10472-009-9137-6

Keywords

Mathematics Subject Classification (2000)

68T05

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A semiparametric generative model for efficient structured-output supervised learning

Abstract

Access this article

Similar content being viewed by others

Natural language processing: state of the art, current trends and challenges

Natural language syntax complies with the free-energy principle

Foundation and large language models: fundamentals, challenges, opportunities, and social impacts

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification (2000)

Navigation

A semiparametric generative model for efficient structured-output supervised learning

Abstract

Access this article

Similar content being viewed by others

Natural language processing: state of the art, current trends and challenges

Natural language syntax complies with the free-energy principle

Foundation and large language models: fundamentals, challenges, opportunities, and social impacts

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification (2000)

Search

Navigation