A New Model for Classifying DNA Code Inspired by Neural Networks and FSA

Kang, Byeong; Kelarev, Andrei; Sale, Arthur; Williams, Ray

doi:10.1007/11961239_17

Byeong Kang²²,
Andrei Kelarev²²,
Arthur Sale²² &
…
Ray Williams²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4303))

Included in the following conference series:

Pacific Rim Knowledge Acquisition Workshop

464 Accesses
11 Citations

Abstract

This paper introduces a new model of classifiers CL(V,E,ℓ,r) designed for classifying DNA sequences and combining the flexibility of neural networks and the generality of finite state automata. Our careful and thorough verification demonstrates that the classifiers CL(V,E,ℓ,r) are general enough and will be capable of solving all classification tasks for any given DNA dataset. We develop a minimisation algorithm for these classifiers and include several open questions which could benefit from contributions of various researchers throughout the world.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Baldi, P., Brunak, S.: Bioinformatics: The Machine Learning Approach. MIT Press, Cambridge (2001)
MATH Google Scholar
Cormen, T.H., Leiserson, C.E., Rivest, R.L., Stein, C.: Introduction to Algorithms. The MIT Press, Cambridge (2001)
MATH Google Scholar
Dazeley, R.P., Kang, B.H.: Weighted MCRDR: deriving information about relationships between classifications. In: MCRDR, AI 2003: Advances in Artificial Intelligence, Perth, Australia, pp. 245–255 (2003)
Google Scholar
Dazeley, R.P., Kang, B.H.: An online classification and prediction hybrid system for knowledge discovery in databases. In: Proc. AISAT 2004, The 2nd Internat. Conf. Artificial Intelligence in Science and Technology, Hobart, Tasmania, pp. 114–119 (2004)
Google Scholar
Durbin, R., Eddy, S.R., Krogh, A., Mitchison, G.: Biological Sequence Analysis. Cambridge University Press, Cambridge (1999)
Google Scholar
Eilenberg, S.: Automata, Languages, and Machines, vol. A,B. Academic Press, New York (1974)
MATH Google Scholar
Gallian, J.A.: Graph labeling. Electronic J. Combinatorics, Dynamic Survey DS6, 148 (January 20, 2005), http://www.combinatorics.org
Gusfield, D.: Algorithms on Strings, Trees, and Sequences. In: Computer Science and Computational Biology, Cambridge University Press, Cambridge (1997)
Google Scholar
Holub, J., Iliopoulos, C.S., Melichar, B., Mouchard, L.: Distributed string matching using finite automata, Combinatorial Algorithms. In: AWOCA 1999, Perth, pp. 114–127 (1999)
Google Scholar
Jones, N.C., Pevzner, P.A.: An Introduction to Bioinformatics Algorithms. MIT Press, Cambridge (2004), http://www.bioalgorithms.info/
Google Scholar
Kang, B.H.: Pacific Knowledge Acquisition Workshop, Auckland, New Zealand (2004)
Google Scholar
Kelarev, A.V.: Ring Constructions and Applications. World Scientific, Singapore (2002)
MATH Google Scholar
Kelarev, A.V.: Graph Algebras and Automata. Marcel Dekker, New York (2003)
MATH Google Scholar
Kelarev, A.V., Miller, M., Sokratova, O.V.: Directed graphs and closure properties for languages. In: Baskoro, E.T. (ed.) Proc.12 Australasian Workshop on Combinatorial Algorithms, Putri Gunung Hotel, Lembang, Bandung, Indonesia, July 14–17, pp. 118–125 (2001)
Google Scholar
Kelarev, A.V., Miller, M., Sokratova, O.V.: Languages recognized by two-sided automata of graphs. Proc. Estonian Akademy of Science 54(1), 46–54 (2005)
MATH MathSciNet Google Scholar
Kelarev, A.V., Sokratova, O.V.: Languages recognized by a class of finite automata. Acta Cybernetica 15, 45–52 (2001)
MATH MathSciNet Google Scholar
Kelarev, A.V., Sokratova, O.V.: Directed graphs and syntactic algebras of tree languages. J. Automata, Languages & Combinatorics 6(3), 305–311 (2001)
MATH MathSciNet Google Scholar
Kelarev, A.V., Sokratova, O.V.: Two algorithms for languages recognized by graph algebras. Internat. J. Computer Math. 79(12), 1317–1327 (2002)
Article MATH MathSciNet Google Scholar
Kelarev, A.V., Sokratova, O.V.: On congruences of automata defined by directed graphs. Theoret. Computer Science 301, 31–43 (2003)
Article MATH MathSciNet Google Scholar
Kelarev, A.V., Trotter, P.G.: A combinatorial property of automata, languages and their syntactic monoids. In: Proceedings of the Internat. Conf. Words, Languages and Combinatorics III, Kyoto, Japan, pp. 228–239 (2003)
Google Scholar
Lee, K.H., Kay, J., Kang, B.H.: Keyword association network: a statistical multi-term indexing approach for document categorization. In: Proc. Fifth Australasian Document Computing Symposium, Brisbane, Australia, pp. 9–16 (2000)
Google Scholar
Lee, K., Kay, J., Kang, B.H.: KAN and RinSCut: lazy linear classifier and rank-in-score threshold in similarity-based text categorization. In: Proc. ICML-2002 Workshop on Text Learning, University of New South Wales, Sydney, Australia, pp. 36–43 (2002)
Google Scholar
Lee, K.H., Kay, J., Kang, B.H., Rosebrock, U.: A Comparative Study on Statistical Machine Learning Algorithms and Thresholding Strategies for Automatic Text Categorization. In: Ishizuka, M., Sattar, A. (eds.) PRICAI 2002. LNCS (LNAI), vol. 2417, pp. 444–453. Springer, Heidelberg (2002)
Chapter Google Scholar
Lee, K.H., Kang, B.H.: A new framework for uncertainty sampling: exploiting uncertain and positive-certain examples in similarity-based text classification. In: Proc. Internat. Conf. on Information Technology: Coding and Computing (ITCC 2004), Las Vegas, Nevada, p. 12 (2004)
Google Scholar
Luger, G.F.: Artificial Intelligence. Structures and Strategies for Complex Problem Solving. Addison-Wesley, Reading (2005)
Google Scholar
Mount, D.: Bioinformatics: Sequence and Genome Analysis, Cold Spring Harbor Laboratory (2001), http://www.bioinformaticsonline.org/
Park, S.S., Kim, Y., Park, G., Kang, B.H., Compton, P.: Automated information mediator for HTML and XML Based Web information delivery service. In: Proc. 18th Australian Joint Conf. on Artificial Intelligence, Sydney, pp. 401–404 (2005)
Google Scholar
Park, G.S., Kim, Y.S., Kang, B.H.: Synamic mobile content adaptation according to various delivery contexts. J. Security Engineering 2, 202–208 (2005)
Google Scholar
Park, G.S., Kim, Y.T., Kim, Y., Kang, B.H.: SOAP message processing performance enhancement by simplifying system architecture. J. Security Engineering 2, 163–170 (2005)
Google Scholar
Park, G.S., Park, S., Kim, Y., Kang, B.H.: Intelligent web document classification using incrementally changing training data Set. J. Security Engineering 2, 186–191 (2005)
Google Scholar
Păun, G., Salomaa, A.: New Trends in Formal Languages. Springer, Berlin (1997)
Google Scholar
Petrovskiy, M.: Probability Estimation in Error Correcting Output Coding Framework Using Game Theory. In: Zhang, S., Jarvis, R. (eds.) AI 2005. LNCS (LNAI), vol. 3809, pp. 186–196. Springer, Heidelberg (2005)
Chapter Google Scholar
Pin, J.E. (ed.): LITP 1988. LNCS, vol. 386. Springer, Heidelberg (1989)
MATH Google Scholar
Rozenberg, G., Salomaa, A.: Handbook of Formal Languages. In: Word, Language, Grammar, vol. 1, Springer, Berlin (1997)
Google Scholar
Smyth, B.: Computing Patterns in Strings. Addison-Wesley, Reading (2003)
Google Scholar
Tuga, M., Miller, M.: Δ-Optimum Exclusive Sum Labeling of Certain Graphs with Radius One. In: Akiyama, J., Baskoro, E.T., Kano, M. (eds.) IJCCGGT 2003. LNCS, vol. 3330, pp. 216–225. Springer, Heidelberg (2005)
Chapter Google Scholar
Sugeng, K.A., Miller, M., Slamin, Bača, M.: ( a, d)-Edge-Antimagic Total Labelings of Caterpillars. In: Akiyama, J., Baskoro, E.T., Kano, M. (eds.) IJCCGGT 2003. LNCS, vol. 3330, pp. 169–180. Springer, Heidelberg (2005)
Chapter Google Scholar
van Leeuwen, J.: Handbook of Theoretical Computer Science. In: Algorithms and Complexity, vol. A,B, Elsevier, Amsterdam (1990)
Google Scholar
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Francisco (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computing, University of Tasmania, Private Bag 100, Hobart, Tasmania, 7001, Australia
Byeong Kang, Andrei Kelarev, Arthur Sale & Ray Williams

Authors

Byeong Kang
View author publications
You can also search for this author in PubMed Google Scholar
Andrei Kelarev
View author publications
You can also search for this author in PubMed Google Scholar
Arthur Sale
View author publications
You can also search for this author in PubMed Google Scholar
Ray Williams
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science & Engineering, The University of New South Wales, Sydney, Australia
Achim Hoffmann
School of Computing, University of Tasmania, Sandy Bay, 7005, Tasmania, Australia
Byeong-ho Kang
Computing Department, Division of Information and Communication Sciences, Macquarie University, 2109, Sydney, NSW, Australia
Debbie Richards
Shimane University, 89-1 Enya-cho Izumo, 6938501, Shimane, Japan
Shusaku Tsumoto

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kang, B., Kelarev, A., Sale, A., Williams, R. (2006). A New Model for Classifying DNA Code Inspired by Neural Networks and FSA. In: Hoffmann, A., Kang, Bh., Richards, D., Tsumoto, S. (eds) Advances in Knowledge Acquisition and Management. PKAW 2006. Lecture Notes in Computer Science(), vol 4303. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11961239_17

Download citation

DOI: https://doi.org/10.1007/11961239_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68955-3
Online ISBN: 978-3-540-68957-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics