On the Optimality of Tape Merge of Two Lists with Similar Size

Li, Qian; Sun, Xiaoming; Zhang, Jialin

doi:10.1007/s00453-020-00690-x

On the Optimality of Tape Merge of Two Lists with Similar Size

Published: 19 February 2020

Volume 82, pages 2107–2132, (2020)
Cite this article

Algorithmica Aims and scope Submit manuscript

Qian Li¹^nAff4,
Xiaoming Sun^2,3 &
Jialin Zhang^2,3

223 Accesses
1 Citation
Explore all metrics

Abstract

The problem of merging sorted lists in the least number of pairwise comparisons has been solved completely only for a few special cases. Graham and Karp (Sorting Search 3:197–207, 1999) independently discovered that the tape merge algorithm is optimal in the worst case when the two lists have the same size. In their seminal papers, Stockmeyer and Yao (SIAM J Comput 9(1):85–90, 1980), Murphy and Paull (Inf Control 42(1):87–96, 1979), and Christen (On the optimality of the straight merging algorithm, 1978) independently showed when the lists to be merged are of size m and n satisfying $m\le n\le \lfloor \frac{3}{2}m\rfloor +1$, the tape merge algorithm is optimal in the worst case. This paper extends this result by showing that the tape merge algorithm is optimal in the worst case whenever the size of one list is no larger than 1.52 times the size of the other. The main tool we use to prove the lower bound is Knuth’s (1999) adversary methods. In addition, we show that the lower bound cannot be improved to 1.8 via Knuth’s adversary methods. Moreover, we design a simple procedure, and by invoking this procedure recursively until the remaining subproblem can be solved efficiently by another known algorithm, we achieve constant improvement of the upper bound for $2m-2\le n\le 3m $.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Longest Common Substring with Approximately k Mismatches

Article Open access 16 February 2019

A Practical Fixed-Parameter Algorithm for Constructing Tree-Child Networks from Multiple Binary Trees

Article 15 February 2022

The p-Median Problem

Notes

As we just mentioned in the Abstract, though the concrete algorithm is not specified in this paper, it can be easily obtained by invoking our procedure recursively.

References

Ajtai, M., Feldman, V., Hassidim, A., Nelson, J.: Sorting and selection with imprecise comparisons. In: Proceedings of the 36th International Colloquium on Automata, Languages and Programming: Part I, ICALP’09, pp. 37–48 (2009)
Brown, M.R., Tarjan, R.E.: A fast merging algorithm. J. ACM 26(2), 211–226 (1979)
MathSciNet MATH Google Scholar
Cardinal, J., Fiorini, S.: On Generalized Comparison-Based Sorting Problems, pp. 164–175. Springer, Berlin (2013)
MATH Google Scholar
Cheng, Y., Sun, X., Yin, Y.L.: Searching monotone multi-dimensional arrays. Discrete Math. 308(11), 2213–2221 (2008)
MathSciNet MATH Google Scholar
Christen, C.: Improving the bounds on optimal merging. In: Proceedings of 19th IEEE Conference on the Foundations of Computer Science, pp. 259–266 (1978)
Christen, C.: On the optimality of the straight merging algorithm (1978) (unpublished)
de la Vega, W.F., Frieze, M.A., Santha, M.: Average-case analysis of the merging algorithm of hwang and lin. Algorithmica 22(4), 483–489 (1998)
MathSciNet MATH Google Scholar
de la Vega, W.F., Kannan, S., Santha, M.: Two probabilistic results on merging. SIAM J. Comput. 22(2), 261–271 (1993)
MathSciNet MATH Google Scholar
Gavril, F.: Merging with parallel processors. Commun. ACM 18(10), 588–591 (1975)
MATH Google Scholar
Gupta, A., Kumar, A.: Sorting and selection with structured costs. In: Proceedings of the 42nd IEEE Symposium on Foundations of Computer Science, p. 416 (2001)
Huang, Z., Kannan, S., Khanna, S.: Algorithms for the generalized sorting problem. In: Proceedings of the 52nd IEEE Symposium on Foundations of Computer Science, pp. 738–747 (2011)
Hwang, F.K., Deutsch, D.N.: A class of merging algorithms. J. ACM 20(1), 148–159 (1973)
MathSciNet MATH Google Scholar
Hwang, F., Lin, S.: A simple algorithm for merging two disjoint linearly ordered sets. SIAM J. Comput. 1(1), 31–39 (1972)
MathSciNet MATH Google Scholar
Hwang, F.K.: Optimal merging of 3 elements with $n$ elements. SIAM J. Comput. 9(2), 298–320 (1980)
MathSciNet MATH Google Scholar
Hwang, F.K., Lin, S.: Optimal merging of 2 elements with $n$ elements. Acta Inf. 1(2), 145–158 (1971)
MathSciNet MATH Google Scholar
Kannan, S., Khanna, S.: Selection with monotone comparison costs. In: Proceedings of the Fourteenth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA’03, pp. 10–17 (2003)
Knuth, D.: The art of computer programming. Sorting Search. 3, 197–207 (1999)
Google Scholar
Li, Q., Sun, X., Zhang, J.: Tables of $\lambda $m$\rho $ and the program. http://theory.ict.ac.cn/liqian (2016)
Linial, N., Saks, M.: Searching ordered structures. J. Algorithms 6(1), 86–103 (1985)
MathSciNet MATH Google Scholar
Linial, N.: The information-theoretic bound is good for merging. SIAM J. Comput. 13(4), 795–801 (1984)
MathSciNet MATH Google Scholar
Manacher, G.K., Bui, T.D., Mai, T.: Optimum combinations of sorting and merging. J. ACM 36(2), 290–334 (1989)
MathSciNet Google Scholar
Manacher, G.K.: Significant improvements to the Hwang–Lin merging algorithm. J. ACM 26(3), 434–440 (1979)
MathSciNet MATH Google Scholar
Murphy, P.E.: A problem in optimal meging. Technical Report DCS-TR-69, Rutgers University, Department of Computer Sciences (1978)
Murphy, P.E., Paull, M.C.: Minimum comparison merging of sets of approximately equal size. Inf. Control 42(1), 87–96 (1979)
MathSciNet MATH Google Scholar
Mönting, J.S.: Merging of 4 or 5 elements with $n$ elements. Theor. Comput. Sci. 14(1), 19–37 (1981)
MathSciNet MATH Google Scholar
Smith, W.D., Lang, K.J.: Values of the merging function and algorithm design as a game (1994). https://core.ac.uk/display/24326238(unpublished)
Stockmeyer, P.K., Yao, F.F.: On the optimality of linear merge. SIAM J. Comput. 9(1), 85–90 (1980)
MathSciNet MATH Google Scholar
Tanner, R.M.: Minimean merging and sorting: an algorithm. SIAM J. Comput. 7(1), 18–38 (1978)
MathSciNet MATH Google Scholar
Thanh, M., Alagar, V.S., Bui, T.D.: Optimal expected-time algorithms for merging. J. Algorithms 7(3), 341–357 (1986)
MathSciNet MATH Google Scholar
Thanh, M., Bui, T.D.: An improvement of the binary merge algorithm. BIT Numer. Math. 22(4), 454–462 (1982)
MathSciNet MATH Google Scholar

Download references

Acknowledgements

This work was supported by the 973 Program of China Grant No. 2016YFB1000201, the National Natural Science Foundation of China Grants Nos. 61832003, 61761136014, 61872334, 61433014, and K.C.Wong Education Foundation.

Author information

Qian Li
Present address: Guangdong Province Key Laboratory of Popular High Performance Computers, Shenzhen University, Shenzhen, China

Authors and Affiliations

Shenzhen Institute of Computing Sciences, Shenzhen University, Shenzhen, China
Qian Li
CAS Key Lab of Network Data Science and Technology, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China
Xiaoming Sun & Jialin Zhang
University of Chinese Academy of Sciences, Beijing, 100049, China
Xiaoming Sun & Jialin Zhang

Authors

Qian Li
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoming Sun
View author publications
You can also search for this author in PubMed Google Scholar
Jialin Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jialin Zhang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix: Proof of Lemma 4

Proof

We apply induction on m and n. The cases for $1\le m+n \le 10$ are given in [17]. Now suppose that $m+n\ge 11$ and the lemma is already established for any $(m',n')$ satisfying $m'+n'<m+n$.

Part (b)Case 1$m=1$.

The algorithm begins by comparing $a_1$ and $b_n+1$. Suppose the adversary claims $a_1>b_{n+1}$, then the algorithm is done with only one comparison! Otherwise, the adversary claims $a_1<b_{n+1}$, then it reduces to the case (1, n). Thus

$$\begin{aligned} /M.(1,n+1)\le /M^{1,n+1}.(1,n+1)=\max \{1, 1+/M.(1,n)\}=1+/M.(1,n). \end{aligned}$$

Case 2$m\ge 2$.

The algorith begins by comparing $a_m$ and $b_{n+1}$. Suppose the adversary claims $a_m>b_{n+1}$, then it reduces to the case $(m-1,n+1)$. Otherwise, the adversary claims $a_m<b_{n+1}$, then it reduces to the case (m, n). Thus

$$\begin{aligned} /M.(m, n+1)\le /M^{m,n+1}.(m,n+1)\le \max \{/M.(m-1, n+1), /M.(m, n)\}+1. \end{aligned}$$

By two applications of the induction hypothesis, $/M. (m, n)\ge /M. (m-1, n)+1\ge /M.(m-1, n+1)$, thus $/M.(m, n+1)\le /M.(m, n)+1$.

Part (c)Case 1$m=n$.

According to Part (a), $/M.(m+1,m)\ge /M.(m,m-1)+2\ge /M.(m-1,m-2)+4\ge \cdots \ge /M.(5,4)+2m-8=2m$, thus $/M.(m+1,m)\ge /M.(m,m)+1$, since $/M.(m,m)\le 2m-1$.

Case 2$m<n$, we have

$$\begin{aligned} /M. (m+1, n)\ge /M. (m, n-1)+2\ge /M. (m, n)+1. \end{aligned}$$

The first inequality is due to Part(a), and the second one is by the induction hypothesis. $\square $

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, Q., Sun, X. & Zhang, J. On the Optimality of Tape Merge of Two Lists with Similar Size. Algorithmica 82, 2107–2132 (2020). https://doi.org/10.1007/s00453-020-00690-x

Download citation

Received: 06 February 2020
Accepted: 10 February 2020
Published: 19 February 2020
Issue Date: July 2020
DOI: https://doi.org/10.1007/s00453-020-00690-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On the Optimality of Tape Merge of Two Lists with Similar Size

Abstract

Access this article

Similar content being viewed by others

Longest Common Substring with Approximately k Mismatches

A Practical Fixed-Parameter Algorithm for Constructing Tree-Child Networks from Multiple Binary Trees

The p-Median Problem

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendix: Proof of Lemma 4

Proof

Rights and permissions

About this article

Cite this article

Keywords

Navigation

On the Optimality of Tape Merge of Two Lists with Similar Size

Abstract

Access this article

Similar content being viewed by others

Longest Common Substring with Approximately k Mismatches

A Practical Fixed-Parameter Algorithm for Constructing Tree-Child Networks from Multiple Binary Trees

The p-Median Problem

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendix: Proof of Lemma 4

Appendix: Proof of Lemma 4

Proof

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation