Skip to main content
Log in

NestDE: generic parameters tuning for automatic story segmentation

  • Focus
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

Parameters tuning is a crucial task in automatic story segmentation. For most previous story segmentation methods, however, the parameters were simply derived from empirical tuning, which may indeed harm the fairness of the evaluation, or even misguide the conclusion. In this paper, we present a general parameters tuning approach, namely nested differential evolution. As a practical general-purpose parameters tuner, our approach itself is parameters-robust and is generic enough to optimize the most usual types of parameters for the given corpus and evaluation criterion. Besides, our approach is able to cooperate with empirical tuning and jointly produce better parameters based on the prior knowledge of experienced users. Extensive experiments on synthetic challenging quadratic pseudo-Boolean optimization and real-world story segmentation tasks validate the superior performance of our approach over traditional empirical tuning and other generic optimizers, such as simulated annealing and classical differential evolution.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Notes

  1. Parameters tuning is not only important in story segmentation, in low-level vision (Feng and Liu 2008; Feng et al. 2010, 2013), but parameters are also crucial for an algorithm to obtain good performance.

  2. To facilitate parameters tuning, besides raw data, the tuning corpus \(\mathbb {C}\) should also contain ground truth labeling.

References

  • Brest J, Greiner S, Boskovic B, Mernik M, Zumer V (2006) Self-adapting control parameters in differential evolution: a comparative study on numerical benchmark problems. IEEE Trans Evol Comput 10(6):646–657

    Article  Google Scholar 

  • CCTV Corpus (2010) Story segmentation and topic detection of CCTV Mandarin broadcast news

  • Choi FYY, Wiemer-Hastings P, Moore J (2001) Latent semantic analysis for text segmentation. In: Proceedings of EMNLP, pp 109–117

  • Das S, Suganthan PN (2011) Differential evolution: a survey of the state-of-the-art. IEEE Trans Evol Comput 15(1):4–31

    Article  Google Scholar 

  • Eiben AE, Smit SK (2011) Parameter tuning for configuring and analyzing evolutionary algorithms. Swarm Evol Comput 1(1):19–31

    Article  Google Scholar 

  • Feng W, Liu ZQ (2008) Region-level image authentication using Bayesian structural content abstraction. IEEE Trans Image Process 17(12):2413–2424

    Article  MathSciNet  Google Scholar 

  • Feng W, Jia J, Liu ZQ (2010) Self-validated labeling of Markov random fields for image segmentation. IEEE Trans Pattern Anal Mach Intell 32(10):1871–1887

    Article  Google Scholar 

  • Feng W, Nie X, Wan L, Xie L, Jiang J (2012) Lexical story co-segmentation of Chinese broadcast news. In: Proceedings of INTERSPEECH, pp 2283–2286

  • Feng W, Liu ZQ, Wan L, Pun CM, Jiang J (2013) A spectral-multiplicity-tolerant approach to robust graph matching. Pattern Recognit 46(10):2819–2829

    Article  Google Scholar 

  • Hearst MA (1997) Texttiling: segmenting text into multi-paragraph subtopic passages. Comput Linguist 23(1):33–64

    Google Scholar 

  • Kolmogorov V, Rother C (2007) Minimizing nonsubmodular functions with graph cuts: a review. IEEE Trans Pattern Anal Mach Intell 29(7):1274–1279

    Article  Google Scholar 

  • Lampinen J, Zelinka I (1999a) Mixed integer-discrete-continuous optimization by differential evolution, part 1: the optimization method. In: Proceedings of international mendel conference on soft computing, pp 71–76

  • Lampinen J, Zelinka I (1999b) Mixed integer-discrete-continuous optimization by differential evolution, part 2: a practical example. In: Proceedings of international mendel conference on soft computing, pp 77–81

  • Li L, Feng W, Wan L, Zhang J (2013) Maximum cohesive grid of superpixels for fast object localization. In: Proceedings of CVPR, pp 3174–3181

  • Liu J, Lampinen J (2002) On setting the control parameter of the differential evolution method. In: Proceedings of international conference on soft computing, pp 11–18

  • Liu Z, Xie L, Feng W (2010) Maximum lexical cohesion for fine-grained news story segmentation. In: Proceedings of INTERSPEECH, ISCA, pp 1301–1304

  • Morris J, Hirst G (1991) Lexical cohesion computed by thesaural relations as an indicator of the structure of text. Comput Linguist 17(1):21–48

    Google Scholar 

  • Nie X, Feng W, Wan L, Xie L (2013) Measuring semantic similarity by contextual word connections in Chinese news story segmentation. In: Proceedings of ICASSP, pp 8312–8316

  • Price KV, Storn RN, Lampinen JA (2005) Differential evolution: a practical approach to global optimization. Springer, New York

  • Qin AK, Huang VL, Suganthan PN (2009) Differential evolution algorithm with strategy adaptation for global numerical optimization. IEEE Trans Evolut Comput 13(2):398–417

    Article  Google Scholar 

  • Rother C, Kohli P, Feng W, Jia J (2009) Minimizing sparse higher order energy functions of discrete variables. In: Proceedings of CVPR, pp 1382–1389

  • Smit SK, Eiben AE (2009) Comparing parameter tuning methods for evolutionary algorithms. In: Proceedings of IEEE congress on evolutionary computation, pp 399–406

  • Smit SK, Eiben AE (2010) Parameter tuning of evolutionary algorithms: generalist vs.specialist. In: Applications of evolutionary computation. LNCS, vol 6024. Springer, New York, pp 542–551

  • TDT2 Corpus (2000) Topic detection and tracking phase 2. http://projects.ldc.upenn.edu/TDT2/

  • Xie L, Zheng L, Liu Z, Zhang Y (2012) Laplacian eigenmaps for automatic story segmentation of broadcast news. IEEE Trans Audio Speech Lang Process 20(1):276–289

    Article  Google Scholar 

  • Zhang J, Xie L, Feng W, Zhang Y (2009) A subword normalized cut approach to automatic story segmentation of Chinese broadcast. Aisa Inf Retreival Symp LNCS 5839:136–148

    Google Scholar 

Download references

Acknowledgments

This work is supported by the Program for New Century Excellent Talents in University (NCET-11-0365), National Nature Science Foundation of China (61100121, 61175018), and National Science and Technology Support Project (2013BAK01B01).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wei Feng.

Additional information

Communicated by L. Xie.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Feng, W., Yin, X., Zhang, Y. et al. NestDE: generic parameters tuning for automatic story segmentation. Soft Comput 19, 61–70 (2015). https://doi.org/10.1007/s00500-014-1450-2

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-014-1450-2

Keywords

Navigation