Skip to main content
Log in

Transmission of images by unmanned underwater vehicles

  • Published:
Autonomous Robots Aims and scope Submit manuscript

Abstract

As an acoustic communications medium, water is characterized by frequency dependent attenuation, short range, very low bandwidth, scattering, and multi-path. It is generally difficult to acoustically communicate even terse messages underwater much less images. For the naval mine counter-measures mission, there is value in transmitting images of mine-like objects, acquired by side-scan sonar on-board unmanned underwater vehicles, to the above-water operator for review. The contribution of this paper is a methodology and implementation, based on vector quantization, to compress and transmit snippets of side-scan sonar images from underway unmanned underwater vehicles to an operator. The work has been validated through controlled indoor tank tests and several at-sea trials. The fidelity of the received images is such that trained operators can recognize targets in the received images as well as they would have in the original images. Future work investigates machine learning to improve the compression basis and psycho-visual studies for the specialized skill of feature recognition in sonar images.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15

Similar content being viewed by others

References

  • Atallah, A. M., Ali, H. S. & Abdallah, M. I. (2016). An integrated system for underwater wireless image transmission. In Proceedings of the 28th international conference on microelectronics (pp. 169–172).

  • Bhattacharyya, P., Mitra, A., & Chatterjee, A. (2014). Vector quantization based image compression using generalized improved fuzzy clustering. In Proceedings of the 2014 international conference on control, instrumentation, energy and communication (pp. 662–666).

  • Bin, L., & Qinggang, M. (2013). An improved SPIHT wavelet transform in the underwater acoustic image compression. In Proceedings of 2013 2nd international conference on measurement, information and control (ICMID) (Vol. 2, pp. 1315–1318).

  • Chen, W., Yuan, F., Cheng, E., & Lin, W. (2017). Subjective and objective quality evaluation of sonar images for underwater acoustic transmission. In Proceedings of the 2017 IEEE International Conference on Image Processing (pp. 176–180).

  • Committee for Mine Warfare Assessment, Naval Studies Board, National Research Council, Naval Mine Warfare: Operational and Technical Challenges for Naval Forces, http://acomms.whoi.edu/wp-content/uploads/sites/20/2014/09/401040-SIG-Micromodem-Software-Interface-Guide.pdf. (2001).

  • DeBortoli, R., Nicolai, A., Li, F., & Hollinger, G. (2017). Assessing perception quality in sonar images using global context. In Proceedings of Intelligent Robots and Systems Conference (pp. 1–8).

  • Eigen Library for Matrix Algebra in C, https://www.quantstart.com/articles/Eigen-Library-for-Matrix-Algebrain-C. Accessed December 2016.

  • Ferdowski, S., Voloshynovskiy, S., & Kostadinov, D. (2017). A multi-layer image representation using regularized residual quantization: Application to compressing and de-noising. In Proceedings of the 2017 international conference on Image Processing (pp. 2697–2701).

  • Hinton, G., Vinyals, O., & Dean, J. (2014). Distilling the knowledge in a neural network. In Proceedings of NIPS Deep Learning Working (pp. 1–9).

  • Huffman, D. A. (1952). A method for the construction of minimum-redundancy codes. In Proceedings of I.R.E. (pp. 1098–1101).

    Article  Google Scholar 

  • Kalwa, J., & Madsen, A. L. (2004). Sonar image quality assessment for an autonomous underwater vehicle. In Proceedings World Automation Congress ( Vol. 15, No. 33–38).

  • Li, J., Wang, W., Li, K., Meng, W., & Guo, X. (2018). Application of image quality assessment in marine biological density prediction. In Proceedings of Chinese Control and Decision Conference (pp. 6658–6661).

  • Liu, P., & Li, G. (2013). An improved SPIHT algorithm for image compression in low bit rate. Communications and Network, 5, 245–248.

    Article  Google Scholar 

  • Moszynski, M., Chybicki, A., Kulawiak, M., & Lubniewski, Z. (2013). A novel method for archiving multibeam sonar data with emphasis on efficient record size reduction and Storage. Polish Maritime Research, 20(1), 77–86.

    Article  Google Scholar 

  • Murphy, C., Wang, R. Y., & Singh, H. (2010). Seafloor image compression with large tilesize vector quantization. In IEEE/OES autonomous underwater vehicles (pp. 1–8).

  • Nams, D., & Seto, M. L. (2015). On-line adaptation of underwater acoustic transmission rates to optimize communications for collaborative AUV missions. In Proceedings of 2015 MTS/IEEE Oceans Conference (pp. 1–8).

  • Purvis, M. (Ed.). Indigo—ROS wiki, edited 2014-08-22, http://wiki.ros.org/indigo. Accessed May 2017.

  • Said, A., & Pearlman, W. A. (1996). A new, fast, and efficient image codec based on set partitioning in hierarchical trees. IEEE Transactions on Circuits and Systems for Video Technology, 6(3), 243–250.

    Article  Google Scholar 

  • Shen, J. & Huang, H.C. (2010). An adaptive image compression method based on vector quantization. In IEEE 1st international conference on pervasive computing, signal processing and applications (pp. 377–381).

  • Skodras, A., Christopoulos, C., & Ebrahimi, T. (2001). The JPG 2000 still image compression standard. IEEE Signal Processing Magazine, 18(5), 36–58.

    Article  Google Scholar 

  • Wallace, G. K. (1992). The jpeg still picture compression standard. IEEE Transactions on Consumer Electronics, 38(1), xviii-xxxiv.

    Article  Google Scholar 

  • Wang, Z., Bovik, A. C., Sheikh, H. R., & Simoncelli, E. P. (2004). Image quality assessment: From error visibility to structural similarity. IEEE Transactions on Image Processing, 13(4), 1–14.

    Article  Google Scholar 

  • WHOI Acoustic Communications Group: Micro-Modem Software Interface Guide, Document ID: 401040, Version 3.25. (2014).

  • Yang, M., & Bourbakis, N. (2005). An overview of lossless digital image compression techniques. In IEEE (pp. 1099–1102).

  • Zhang, Y., Li, Q. Z., & Negahdaripour, S. (2015). Seafloor image compression using hybrid wavelets and directional filter banks. In Proceedings of the MTS/IEEE Oceans Conference (pp. 1–9).

  • Zhang, Y., Negahdaripour, S., & Li, Q. (2016). Low bit-rate compression of underwater imagery based on adaptive hybrid wavelets and directional filter banks. Signal Processing: Image Communication, 47, 96–114.

    Google Scholar 

Download references

Acknowledgements

This project is grateful for the timely and helpful advice and assistance of the WHOI Acoustic Communications Group.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mae L. Seto.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This is one of the several papers published in Autonomous Robots comprising Special Issue on Robot Communication Challenges: Real-World Problems, Systems, and Methods.

Appendix

Appendix

1.1 Implementation as ROS nodes

ROS (robotic operating system) is a widely supported open source middleware for robotic systems (Purvis 2017). It provides operating system services, including hardware abstraction, low-level device control, implementation of commonly used functionality, message-passing between processes, and package management. It also provides tools and libraries to build, write, and run code across multiple platforms. It is light weight enough that it is widely applied in embedded systems. ROS works as a publish-subscribe architecture.

ROS is used by Defence R&D Canada (DRDC) as a middleware for its IVER3 UUVs’ on-board autonomy. Implementation of the compression and transmit algorithm as ROS nodes facilitates rapid integration into this autonomy.

Fig. 16
figure 16

Interconnection of nodes on the transmit (UUV) unit

There are nodes on the transmit (UUV) side and the receive (operator) side. On the transmit (UUV) side, the compression algorithm consists of 3 ROS nodes described next and illustrated as a ROS rqt_graph in Fig. 16.

  • talker Tasked with publishing the path of the images to compress on the dedicated topic imgToCompress. Initially, it simulated the ATD algorithm detecting MLO images from the sonar data stream. Later, it was directly integrated with the ATD

  • compressor Subscribes to the topic imgToCompress. Each time it receives a new image path it compresses the image and publishes the encoded image on the topic imgToDecompress.

  • uuvModem This node subscribes to the topic imgToDecompress. It is connected to the acoustic modem through a serial port. For each encoded image it receives, it sends it through the underwater acoustic link using the appropriate messages.

Fig. 17
figure 17

Interconnection of nodes on the receive (operator) unit

Fig. 18
figure 18

Example of a message in an acoustic packet as formed by ModemUUV node

On the receive (operator) side (Fig. 17) are two nodes:

  • opModem This node is connected to the receiving acoustic modem through a serial port. It ‘listens’ to all messages sent by the UUV’s modem to find the packet that contains the encoded image. When that message is found, it extracts the content of the message and publishes it on the topic imgToDecompress.

  • decompressor This node subscribes to the topic imgToDecompress and when a new encoded image is received, it decodes it and saves the reconstructed image to the / out folder.

1.2 Underwater acoustic communications

The acoustic modems can transmit/receive at several discrete rates that vary with packet size. For the initial in-laboratory tests, rate 4 (Table 1) was chosen so an encoded image could fit into the minimum number of packets of one frame each.This makes it easier to perform the nodes’ algorithmic proof-of-principle tests at the higher risk of losing packets. Losing the occasional packet was not a primary concern for these tests. Packet loss will be addressed in the at-sea tests. The proof-of -principle tests were performed in an indoor water tank where the transmit and receive acoustic modems were 2–3 m apart which limits packet loss. The final at-sea implementation will be performed at rate 1 which is more reliable albeit at a cost of smaller packets which means an image would be transmitted over multiple frames. This will be explained in a later section. The maximum size of a frame at rate 4, is 256 bytes so an encoded image could be transmitted over two packets. This means half of one encoded image, at a time, would be published on the topic imgToDecompress. As shown in Fig. 18, the message published contains a header with the first character identifying whether the packet contains the top or bottom half of an image followed by the image identifier coded on four characters to recognize the origin of the half image. The rest of the message is half of the encoded image.

The uuvModem node is connected to the acoustic modem through a serial port, which was configured in write-only mode. At initialization, it gives the acoustic modem an identifier to send messages in the later stages. The syntax to set the identifier to ‘2’ is:

$$\begin{aligned} {\$}{\mathrm{CCCFG, SRC}}, 2\backslash {\mathrm{r}}\backslash {\mathrm{n}} . \end{aligned}$$
(1)

Then, when a message is published on the topic it writes the following instruction to the serial port:

$$\begin{aligned} {\$}{\mathrm{CCCYC,0,2,1,4,0,1}}\backslash {\mathrm{r}}\backslash {\mathrm{n}} . \end{aligned}$$
(2)

This informs the acoustic modem that a packet will be sent from modem number 2 to modem number 1 with acoustic modem rate 4 containing 1 frame. The modem then replies with a query for the packet to send with:

(3)

The user then provides the hex-encoded message to the modem:

(4)

On the operator side (Fig. 17), the ModemOp node configures the connection with the modem through the serial port to read and write mode. At initialization of the node, it will set the identifier for the operator’s modem to ‘1’ using a command similar to (1) above. Then, the node listens to the modem output through the serial port. If it reads a CARXD message it means a packet was received. The content of that packet is read and published on the topic imgToDecompress.

The decompressor node is subscribing (listening) to the topic imgToDecompress. When a message is published on that topic, this node will identify (1) whether it is for the top or bottom half of an image and (2) its unique identifier from the message header. If it previously received the other half of the image, it will concatenate the two and decompress the image using functions from the VQScheme class. But, if the other half came from a different image, then the older packet will be decompressed after zero padding its missing half. The last packet received will be stored to wait for its other half for decompression. If no other half was received, the node will wait for the next packet for decompression.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Danckaers, A., Seto, M.L. Transmission of images by unmanned underwater vehicles. Auton Robot 44, 3–24 (2020). https://doi.org/10.1007/s10514-019-09866-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10514-019-09866-z

Keywords

Navigation