Approximate Location of Relevant Variables under the Crossover Distribution

Damaschke, Peter

doi:10.1007/3-540-45322-9_13

Peter Damaschke⁵

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2264))

Included in the following conference series:

International Symposium on Stochastic Algorithms

377 Accesses

Abstract

Searching for genes involved in traits (e.g. diseases), based on genetic data, is considered from a computational learning perspective. This leads to the problem of learning relevant variables of functions from data sampled from a certain class of distributions generalizing the uniform distribution. The Fourier transform of Boolean functions is applied to translate the problem into searching for local extrema of certain functions of observables. We work out the combinatorial structure of this approach and illustrate its potential use.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

D.A. Bell, H. Wang: A formalism for relevance and its application in feature subset selection, Machine Learning 41 (2000), 175–195
Article MATH Google Scholar
A. Bernasconi: Mathematical techniques for the analysis of Boolean functions, PhD thesis, Univ. Pisa 1998
Google Scholar
N. Bshouty, J.C. Jackson, C. Tamon: More efficient PAC-learning of DNF with membership queries under the uniform distribution, ACM Symp. on Computational Learning Theory COLT’99, 286–293
Google Scholar
P. Damaschke: Adaptive versus nonadaptive attribute-efficient learning, Machine Learning 41 (2000), 197–215
Article MATH Google Scholar
P. Damaschke: Parallel attribute-efficient learning of monotone Boolean functions, 7th Scand. Workshop on Algorithm Theory SWAT’2000, LNCS 1851, 504–512, journal version accepted for J. of Computer and System Sciences
Google Scholar
A.S. Goldstein, E.M. Reingold: A Fibonacci version of Kraft’s inequality with an application to discrete unimodal search, SIAM J. Computing 22 (1993), 751–777
Article MATH MathSciNet Google Scholar
J.C. Jackson: An efficient membership-query algorithm for learning DNF with respect to the uniform distribution, J. of Comp. and Sys. Sci. 55 (1997), 414–440
Article MATH Google Scholar
G.H. John, R. Kohavi, K. Pfleger: Irrelevant features and the subset selection problem, 11th Int. Conf. on Machine Learning 1994, Morgan Kaufmann, 121–129
Google Scholar
D.S. Johnson (ed.): Challenges for Theoretical Computer Science (draft), available at http://www.research.att.com/~dsj/nflist.html#Biology
S. Karlin, U. Liberman: Classifications and comparisons of multilocus recombination distribution, Proc. Nat. Acad. Sci. USA 75 (1979), 6332–6336
Google Scholar
M.J. Kearns, R.E. Schapire: Efficient distribution-free learning of probabilistic concepts, in: Computational Learning Theory and Natural Learning Systems, MIT Press 1994, 289–329 (preliminary version in FOCS’90)
Google Scholar
R. Kohavi: Feature subset selection as search with probabilistic estimates, in: R. Greiner, D. Subramanian (eds.): Relevance, Proc. 1994 AAAI Fall Symposium, 122–126
Google Scholar
W. Li, J. Reich: A complete enumeration and classification of two-locus disease models, Human Hereditary (1999)
Google Scholar
N. Linial, Y. Mansour, N. Nisan: Constant depth circuits, Fourier transform, and learnability, J. of ACM 40 (1993), 607–620
Article MATH MathSciNet Google Scholar
Y. Mansour: Learning Boolean functions via the Fourier transform, in: Theoretical Advances in Neural Computing and Learning, Kluwer 1994
Google Scholar
A. Mathur, E.M. Reingold: Generalized Kraft’s inequality and discrete k-modal search, SIAM J. Computing 25 (1996), 420–447
Article MATH MathSciNet Google Scholar
J.C. Schlimmer: Efficiently inducing determinations: a complete and systematic search algorithm that uses optimal pruning, 10th Int. Conf. on Machine Learning 1993, Morgan Kaufmann, 284–290
Google Scholar
J.D. Terwilliger, H.H.H. Göring: Gene mapping in the 20th and 21st centuries: statistical methods, data analysis, and experimental design, Human Biology 72 (2000), 63–132
Google Scholar

Download references

Author information

Authors and Affiliations

Mathematical and Computing Sciences, Chalmers University, 41296, Göteborg, Sweden
Peter Damaschke

Authors

Peter Damaschke
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute for Computer Architecture and Software Engineering, GMD - National ResearchCenter for Information Technology, Kekuléstr.7, 12489, Berlin-Adlershof, Germany
Kathleen Steinhöfel

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Damaschke, P. (2001). Approximate Location of Relevant Variables under the Crossover Distribution. In: Steinhöfel, K. (eds) Stochastic Algorithms: Foundations and Applications. SAGA 2001. Lecture Notes in Computer Science, vol 2264. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45322-9_13

Download citation

DOI: https://doi.org/10.1007/3-540-45322-9_13
Published: 08 February 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43025-4
Online ISBN: 978-3-540-45322-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics