Data Mining Applied to the Electric Power Industry: Classification of Short-Circuit Faults in Transmission Lines

  • Yomara Pires
  • Jefferson Morais
  • Claudomir Cardoso
  • Aldebaro Klautau
Part of the Studies in Computational Intelligence book series (SCI, volume 169)


Data mining can play a fundamental role in modern power systems. However, the companies in this area still face several difficulties to benefit from data mining. A major problem is to extract useful information from the currently available non-labeled digitized time series. This work focuses on automatic classification of faults in transmission lines. These faults are responsible for the majority of the disturbances and cascading blackouts. To circumvent the current lack of labeled data, the Alternative Transients Program (ATP) simulator was used to create a public comprehensive labeled dataset. Results with different preprocessing (e.g., wavelets) and learning algorithms (e.g., decision trees and neural networks) are presented, which indicate that neural networks outperform the other methods.


Transmission Line Data Mining Technique Power Quality Frame Length Data Mining Apply 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Bell, S., McArthur, S., McDonald, J., et al.: Model-based analysis of protection system performance. IEE Proc. Gener. Transm. Distrib. 145, 547–552 (1998)CrossRefGoogle Scholar
  2. 2.
    Zhang, L., Chen, S., Hu, Q.: Dynamic shape modeling of consumers’ daily load based on data mining. In: Advanced Data Mining and Applications (ADMA), pp. 712–719 (2005)Google Scholar
  3. 3.
    Kitayama, M., Matsubara, R., Izui, Y.: Application of data mining to customer profile analysis in the power electric industry. Power Engineering Society Winter Meeting 1, 632–634 (2002)Google Scholar
  4. 4.
    Bollen, M.H.J.: Understanding power quality problems: Voltage sags and interruptions. In: IEEE Press Series on Power Engineering (2000)Google Scholar
  5. 5.
    Santoso, S., Power Lamoree, J.D.: Power quality data analysis: From raw data to knowledge using knowledge discovery approach. In: Engineering Society Summer Meeting, pp. 172–177. IEEE, Los Alamitos (2000)Google Scholar
  6. 6.
    Vasilic, S.: Fuzzy Neural Network Pattern Recognition Algorithm for Classification of the Events in Power System Networks. PhD thesis, Texas A&M University (2004)Google Scholar
  7. 7.
    Mitchell, T.: Machine Learning. McGraw-Hill, New York (1997)zbMATHGoogle Scholar
  8. 8.
    Hastie, T., Tibshirani, R., Friedman, J.: The elements of statistical learning. Springer, Heidelberg (2001)zbMATHGoogle Scholar
  9. 9.
    Witten, I., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations, 2nd edn. Morgan Kaufmann, San Francisco (2005)Google Scholar
  10. 10.
    Luo, X., Kezunovic, M.: Fault analysis based on integration of digital relay and DFR. Power Engineering Society General Meeting 1, 746–751 (2005)Google Scholar
  11. 11.
    Clarke, G., Reynders, D.: Practical Modern SCADA Protocols: DNP3, 60870.5 and Related Systems. Newnes (2004)Google Scholar
  12. 12.
    Boyer, S.A.: SCADA: Supervisory Control and Data Acquisition. In: ISA-The Instrumentation, Systems, and Automation Society (1999)Google Scholar
  13. 13.
    Zhang, N., Kezunovic, M.: A real time fault analysis tool for monitoring operation of transmission line protective relay. Electric Power Systems Research 77, 361–370 (2007)CrossRefGoogle Scholar
  14. 14.
    EMTP. Alternative Transients Program (ATP) Rule Book. Canadian/American EMTP User’s Group (1995)Google Scholar
  15. 15.
    Pires, Y., Santos, A., Borges, J., Carvalho, A., Vinicius, M., Nunes, A., Santoso, S., Klautau, A.: A framework for evaluating data mining techniques applied to power quality. In: Brazilian Conference on Neural Networks, CBRN (2005)Google Scholar
  16. 16.
    Keogh, E., Kasetty, S.: On the need for time series data mining benchmarks: A survey and empirical demonstration. In: The 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining., pp. 102–111 (2002)Google Scholar
  17. 17.
    McEachern, A.: A floating-window algorithm for detecting certain power line faults that disrupt sensitive electronic loads. Instrumentation and Measurement, IEEE Trans. 39, 112–115 (1990)CrossRefGoogle Scholar
  18. 18.
    Barnett, V., Lewis, T.: Outliers in Statistical Data, 3rd edn. Wiley, Chichester (1994)zbMATHGoogle Scholar
  19. 19.
    Anderson, P.M.: Analysis of faulted Power Systems. IEEE Press, Los Alamitos (1995)Google Scholar
  20. 20.
    Vetterli, M., Kovačević, J.: Wavelets and Subband Coding. Prentice Hall, Englewood Cliffs (1995)zbMATHGoogle Scholar
  21. 21.
    Aguilera, C., Orduna, E., Rattá, G.: Fault detection, classification and faulted phase selection approach based on high-frequency voltage signals applied to a series-compensated line. IEEE Proc.- Gener. Transm. Distrib. 153, 469–475 (2006)CrossRefGoogle Scholar
  22. 22.
    Morais, J., Pires, Y., Cardoso, C., Klautau, A.: An experimental evaluation of automatic classification of sequences representing short circuits in transmission lines. In: Brazilian Conference on Neural Networks, CBRN (2007)Google Scholar
  23. 23.
    Li, M., Sleep, R.: A robust approach to sequence classification. In: International Conference on Tools with Artificial Intelligence, p. 5 (2005)Google Scholar
  24. 24.
    Rabiner, L.: A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE 77(2), 257–286 (1989)CrossRefGoogle Scholar
  25. 25.
    Sakoe, H., Chiba, S.: Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans. on ASSP 26(1), 43–49 (1978)zbMATHCrossRefGoogle Scholar
  26. 26.
    Morais, J., Pires, Y., Cardoso, C., Klautau, A.: Data mining applied to the electric power industry: Classification of short-circuit in transmission lines. In: IEEE International Conference on Intelligent Systems Design and Applications, ISDA (2007)Google Scholar
  27. 27.
    Oppenheim, A., Schafer, R.: Discrete-time Signal Processing. Prentice-Hall, Englewood Cliffs (1989)zbMATHGoogle Scholar
  28. 28.
  29. 29.
    Quinlan, J.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)Google Scholar
  30. 30.
    Rifkin, R., Klautau, A.: In defense of one-vs-all classification. Journal of Machine Learning Research 5, 101–141 (2004)MathSciNetGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Yomara Pires
    • 1
  • Jefferson Morais
    • 1
  • Claudomir Cardoso
    • 1
  • Aldebaro Klautau
    • 1
  1. 1.Signal Processing Laboratory (LaPS)Federal University of Pará (UFPA)BelémBrazil

Personalised recommendations