Skip to main content

Relaxed Optimisation for Tensor Principal Component Analysis and Applications to Recognition, Compression and Retrieval of Volumetric Shapes

  • Conference paper
  • First Online:
Imaging, Vision and Learning Based on Optimization and PDEs (IVLOPDE 2016)

Part of the book series: Mathematics and Visualization ((MATHVISUAL))

  • 458 Accesses

Abstract

The mathematical and computational backgrounds of pattern recognition are the geometries in Hilbert space used for functional analysis and the applied linear algebra used for numerical analysis, respectively. Organs, cells and microstructures in cells dealt with in biomedical image analysis are volumetric data. We are required to process and analyse these data as volumetric data without embedding into higher-dimensional vector spaces from the viewpoint of object-oriented data analysis . Therefore, sampled values of volumetric data are expressed as three-way array data. The aim of the paper is to develop relaxed closed forms for tensor principal component analysis (PCA) for the recognition , classification , compression and retrieval of volumetric data. Tensor PCA derives the tensor Karhunen-Loève transform, which compresses volumetric data, such as organs, cells in organs and microstructures in cells, preserving both the geometric and statistical properties of objects and spatial textures in the space.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The project “Computational Anatomy for Computer-aided Diagnosis and Therapy: Frontiers of Medical Image Sciences” funded by Grant-in-Aid for Scientific Research on Innovative Areas, MEXT, Japan. http://www.comp-anatomy.org/wiki/index.php?Computational.

References

  1. A. Andreopoulos, J.K. Tsotsos, Efficient and generalizable statistical models of shape and appearance for analysis of cardiac MRI. Med. Image Anal. 12, 335–357 (2008)

    Article  Google Scholar 

  2. B. Aubert-Broche, M. Griffin, G.B. Pike, A.C. Evans, D.L. Collins, 20 new digital brain phantoms for creation of validation image data bases. IEEE Trans. Med. Imaging 25, 1410–1416 (2006)

    Article  Google Scholar 

  3. A. Bronstein, M. Bronstein, R. Kimmel, Numerical Geometry of Non-Rigid Shapes (Springer, Berlin, 2009)

    Book  Google Scholar 

  4. A. Cichocki, R. Zdunek, A.-H. Phan, S. Amari, Nonnegative Matrix and Tensor Factorizations: Applications to Exploratory Multi-way Data Analysis and Blind Source Separation (Wiley, Hoboken, 2009)

    Book  Google Scholar 

  5. R. Davies, C. Twining, C. Taylor, Statistical Models of Shape Optimisation and Evaluation (Springer, Berlin, 2008)

    MATH  Google Scholar 

  6. U. Grenander, M. Miller, Pattern Theory: From Representation to Inference (OUP, New York, 2007)

    MATH  Google Scholar 

  7. M. Hamidi, J. Pearl, Comparison of the cosine Fourier transform of Markov-1 signals. IEEE ASSP 24, 428–429 (1976)

    Article  MathSciNet  Google Scholar 

  8. H. Itoh, A. Imiya, T. Sakai, Mathematical aspects of tensor subspace method, in Structural, Syntactic, and Statistical Pattern Recognition. Lecture Notes in Computer Science, vol. 10029 (Springer, Cham, 2016), pp. 37–48

    Google Scholar 

  9. T. Iijima, Pattern Recognition (Corona-sha, Tokyo, 1974) (in Japanese)

    Google Scholar 

  10. A. Imiya, U. Eckhardt, The Euler characteristics of discrete objects and discrete quasi-objects. Comput. Vis. Image Underst. 75, 307–318 (1999)

    Article  Google Scholar 

  11. S. Inagaki, H. Itoh, A. Imiya, Variational multiple warping for cardiac image analysis, in Computer Analysis of Images and Patterns. Lecture Notes in Computer Science, vol. 10425 (Springer, Cham, 2015), pp. 749–759

    Chapter  Google Scholar 

  12. S. Inagaki, H. Itoh, A. Imiya, Multiple alignment of spatiotemporal deformable objects for the average-organ computation, in ECCV2014 Workshops 2014. Lecture Notes in Computer Science, vol. 8928 (Springer, Cham, 2015), pp. 353–366

    Google Scholar 

  13. H. Itoh, T. Sakai, K. Kawamoto, A. Imiya, Topology-preserving dimension-reduction methods for image pattern recognition, in Image Analysis. Lecture Notes in Computer Science, vol. 7944 (Springer, Berlin, 2013), pp. 195–204

    Chapter  Google Scholar 

  14. H. Itoh, A. Imiya, T. Sakai, Pattern recognition in multilinear space and its applications: mathematics, computational algorithms and numerical validations. Mach. Vis. Appl. 27, 1259–1273 (2016)

    Article  Google Scholar 

  15. M. Itskov, Tensor Algebra and Tensor Analysis for Engineers (Springer, Berlin, 2013)

    Book  Google Scholar 

  16. T.G. Kolda, B.W. Bader, Tensor decompositions and applications. SIAM Rev. 51, 455–500 (2008)

    Article  MathSciNet  Google Scholar 

  17. P.M. Kroonenberg, Applied Multiway Data Analysis (Wiley, Hoboken, 2008)

    Book  Google Scholar 

  18. L.D. Lathauwer, B. De Moor, J. Vandewalle, A multilinear singular value decomposition. SIAM J. Matrix Anal. Appl. 21, 1253–1278 (2000)

    Article  MathSciNet  Google Scholar 

  19. K. Maeda, From the subspace methods to the mutual subspace method, in Computer Vision, ed. by R. Cipolla, S. Battiato, G.M. Farinella. Studies in Computational Intelligence, vol. 285 (Springer, Berlin, 2010), pp. 135–156

    Google Scholar 

  20. A. Malcev, Foundations of Linear Algebra, Russian edition (1948) (English translation W.H. Freeman and Company, 1963)

    Google Scholar 

  21. J.M. Marron, A.M. Alonso, Overview of object oriented data analysis. Biom. J. 56, 732–753 (2014)

    Article  MathSciNet  Google Scholar 

  22. M. Mørup, Applications of tensor (multiway array) factorizations and decompositions in data mining. Wiley Interdiscip. Rev. Data Min. Knowl. Disc. 1, 24–40 (2011)

    Article  Google Scholar 

  23. T.M.W. Nye, Principal component analysis in the space of phylogenetic trees. Ann. Stat. 39, 2716–2739 (2011)

    Article  MathSciNet  Google Scholar 

  24. E. Oja, Subspace Methods of Pattern Recognition (Research Studies Press, Letchworth, 1983)

    Google Scholar 

  25. N. Otsu: Mathematical studies on feature extraction in pattern recognition, Researches of the Electrotechnical Laboratory, 818 (1981; in Japanese)

    Google Scholar 

  26. T. Sakai, M. Narita, T. Komazaki, H. Nishiguchi, A. Imiya, Image hierarchy in Gaussian scale space, in Advances in Imaging and Electron Physics, vol. 165 (Academic, Cambridge, 2013), pp. 175–263

    Google Scholar 

  27. G. Strang, T. Nguyen, Wavelets and Filter Banks (Wellesley-Cambridge Press, Wellesley, 1996)

    MATH  Google Scholar 

  28. D.W. Thompson, On Growth and Form (The Complete Revised Edition) (Dover, Mineola, 1992)

    Google Scholar 

  29. S. Watanabe, N. Pakvasa, Subspace method of pattern recognition, in Proceedings of the 1st International Joint Conference of Pattern Recognition (1973), pp. 25–32

    Google Scholar 

  30. G.W. Weber, F.L. Bookstein, Virtual Anthropology: A Guide to a New Interdisciplinary Field (Springer, Berlin, 2011)

    Book  Google Scholar 

  31. L. Younes, Shapes and Diffeomorphisms (Springer, Berlin, 2010)

    Book  Google Scholar 

  32. M.L. Zelditch, D.L. Swiderski, H.D. Sheets, Geometric Morphometrics for Biologists: A Primer, 2nd edn. (Academic, Cambridge, 2012)

    MATH  Google Scholar 

Download references

Acknowledgements

This research was supported by the “Multidisciplinary Computational Anatomy and Its Application to Highly Intelligent Diagnosis and Therapy” project funded by a Grant-in-Aid for Scientific Research on Innovative Areas from MEXT, Japan, and by Grants-in-Aid for Scientific Research funded by the Japan Society for the Promotion of Science.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hayato Itoh .

Editor information

Editors and Affiliations

Appendix

Appendix

The linear reduction operation R and its dual E are defined as

$$\displaystyle \begin{aligned} \begin{array}{rcl} {} g(\boldsymbol{x})= Rf(\boldsymbol{y}) &\displaystyle =&\displaystyle \int_{{\mathbf{R}}^3}w_3(\boldsymbol{u})f(2\boldsymbol{x}-\boldsymbol{u})d\boldsymbol{u}, \end{array} \end{aligned} $$
(8.60)
$$\displaystyle \begin{aligned} \begin{array}{rcl} {} Eg(\boldsymbol{x})&\displaystyle =&\displaystyle 2^3 \int_{\mathbb{R}^3} w_3(\boldsymbol{u})g \left(\frac{\boldsymbol{x}-\boldsymbol{u}}{2}\right)d\boldsymbol{u}. \end{array} \end{aligned} $$
(8.61)

where w 3(x) = w(x)w(y)w(z) for x = (x, y, z) and

$$\displaystyle \begin{aligned} \begin{array}{rcl} w(s)=\left\{\begin{array}{ll} \frac{1}{2}(1-\frac{|s|}{2}), & |s|\leq 2\\ 0, & |s|> 2 \end{array}\right. \end{array} \end{aligned} $$
(8.62)

These processes are achieved by computing a weighted average of the image values in a finite small region, which is called the window for the operation.

Setting \(w_{\pm 1}=\frac {1}{4}\) and \(w_0=\frac {1}{2}\), for the two-dimensional sampled function f ijk = f(Δi, Δj, Δk), the transforms of Eqs. (8.60) and (8.61) are respectively described as

$$\displaystyle \begin{aligned} \begin{array}{rcl} {} R f_{kmn}&\displaystyle =&\displaystyle \sum_{p,q,r=-1}^{1}w_{p}w_{q}w_{r} f_{2k-p\, \, 2m-q\,\,2n-p}, \end{array} \end{aligned} $$
(8.63)
$$\displaystyle \begin{aligned} \begin{array}{rcl} E f_{kmn}&\displaystyle =&\displaystyle \frac{1}{2^3} \sum_{p,q,r=-2}^{2} w _{p}w_{q}w_{r} f_{\frac{k-p}{2}\, \, \frac{m-q}{2}\,\,\frac{n-r}{2}}, \end{array} \end{aligned} $$
(8.64)

where the summation is achieved for \(\frac {(k-p)}{2}\), \(\frac {(m-q)}{2}\) and \(\frac {(n-r)}{2}\) being integers. These procedures are called the pyramid transform and extension, respectively. These two operations involve the reduction and expansion of the image sizes. Therefore, image features are extracted in the higher-layer images of the pyramid transform.

Setting

$$\displaystyle \begin{aligned} \boldsymbol{R}=\frac{1}{4}(\boldsymbol{I}\otimes(0,1)^\top)(\boldsymbol{D}+4\boldsymbol{I})\end{aligned} $$
(8.65)

for the second-order differential matrix D with the Neumann condition such that

$$\displaystyle \begin{aligned} \boldsymbol{D}=\left(\begin{array}{ccccccc} -1& 1&0 &0& \cdots & 0& 0\\ 1&-2&1 &0& \cdots & 0& 0\\ 0& 1&-2&1& \cdots & 0& 0\\ \vdots&\vdots&\vdots&\vdots&\ddots&\vdots&\vdots\\ 0& 0& 0& \cdots & 0& 1& -1 \end{array}\right),\end{aligned} $$
(8.66)

the three-dimensional pyramid transform

$$\displaystyle \begin{aligned} g_{pqr}=\sum_{i,j=-1}^{1}w_{i}w_{j}w_{k} f_{2p-i\,\,2q-j\,\,2r-k, },\end{aligned} $$
(8.67)

is redescribed

$$\displaystyle \begin{aligned} \mathcal{Y}=\mathcal{X} \times_1 \boldsymbol{R}\times_2 \boldsymbol{R}\times_3 \boldsymbol{R} \end{aligned} $$
(8.68)

using the Tucker-3 decomposition of \(\mathcal {X}\).

Since the eigenmatrix of D is the DCT-II matrix the three-dimensional pyramid transform processes the following property.

Property 8.2

Setting \(L^N={\mathsf {L}}( \{\boldsymbol {\varphi }_{k} \}_{k=0}^{2^N-1} )\), for three-dimensional images, the pyramid transform is a linear transform from L N × L N × L N to \(L^{\frac {N}{2}}\times L^{\frac {N}{2}} \times L^{\frac {N}{2}} \).

Equation (8.68) directly derives outlines of volumetric shapes by enforcing and inhibiting low- and high-frequency parts, respectively, on the DCT of the volumetric shape. Therefore, the dominant operation in the pyramid transform is the relaxed Karhunen-Loève transform using the DCT.

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Itoh, H., Imiya, A., Sakai, T. (2018). Relaxed Optimisation for Tensor Principal Component Analysis and Applications to Recognition, Compression and Retrieval of Volumetric Shapes. In: Tai, XC., Bae, E., Lysaker, M. (eds) Imaging, Vision and Learning Based on Optimization and PDEs. IVLOPDE 2016. Mathematics and Visualization. Springer, Cham. https://doi.org/10.1007/978-3-319-91274-5_8

Download citation

Publish with us

Policies and ethics