Introduction

In recent years, the development of high-speed, high-capacity communication technology using optical fiber and 5th generation mobile communication systems (5G), together with new surgical robots, has made remote surgery a reality [1]. One of the advantages of remote robotics surgery (RRS) is that it can reduce the physical, mental, and financial burden on patients and surgeons by reducing the need for travel. However, there are still many problems to overcome to implement RRS in society, one of which is the establishment of a stable communication environment. Communication delays or significant packet loss during surgery leads to the distribution of images and inadequate robot functioning, which are major obstacles to safe surgery [2,3,4,5]. To avoid this, it is essential to identify the communication bandwidth (CB) required for safe and stable telecommunication according to the amount of video data and operation data for each surgical robot. The purpose of this study was to identify the required CB for RRS using hinotori™, a novel surgical robot made in Japan. Setting the required CB is essential to ensure future implementation.

Materials and methods

The minimum required CB for RRS using hinotori™ was verified by gradually decreasing the CB from 500 to 100Mbps. We measured the communication round-trip time (RTT; the time in milliseconds from the time the switch on the surgeon cockpit side sends a request to the time the response is received from the switch on the operation unit side), jitter (variation in latency of packet flow), and packet loss (the fraction of the total transmitted packets that did not arrive at the receiver) for each CB. Ten skilled surgeons, including five gastroenterological surgeons, two urologists, two gynecologists, and one thoracic surgeon, participated in this experiment. All had sufficient experience in laparoscopic surgery and robotic surgery.

After we found the minimum required bandwidth, the participants were tested on a standard task (intracorporeal suturing) based on the Fundamentals of Laparoscopic Surgery (FLS) curriculum [6] using different levels of video compression (VC); that is, the process of reducing the total number of bits needed to represent a given image or video sequence (VC1: 120 Mbps, VC2: 40 Mbps, VC3: 20 Mbps) with the minimum required bandwidth. We measured the RTT, jitter, packet loss for the VC, task completion time, and robotic surgical skill using the Global Evaluative Assessment of Robotic Skills (GEARS). Subjective evaluation of the surgeon was validated using the System and Piper Fatigue Scale-12 (PFS-12).

Network connections

The operating rooms of Hokkaido University Hospital and Kyushu University Hospital were connected by SINET5 (Science Information NETwork) [7] (Fig. 1). SINET5 is a non-commercial science information network designed and operated by the National Institute of Informatics, and provides a nation-wide 100-Gbps backbone for about 1000 universities and research institutes throughout Japan. For this investigation, a virtual private communication circuit was established between the two hospitals alongside the Japan Sea, with a circuit distance of about 2000 km on a map basis and about 2600 km on an optical-fiber-length basis. The CBs of the circuit were set up in the range of 500–100 Mbps by specifying the rate limits so as to drop information packets if the usage rate of the circuit exceeded the specified rate limit. Communication information was compressed and decompressed using an encoder and decoder evaluated by Medicaroid (Medicaroid Corporation, Kobe, Japan). The encoders and decoders in this study use H.265 [8], which is a high compression technology that enables ultra-short delay video transmission and has been applied to ultra-short delay live broadcasting. A raw video is a sequence of images, and its size makes it impractical to store or transfer. VC takes advantage of the fact that the frames in a video sequence are highly correlated in time and reduces spatial and temporal redundancy, so that as few bits as possible are used to represent the video sequence. Modern standard video compression algorithms such as H.265 are psycho-visually optimized and compress the video data in such a way that quality and detail reduction are realistically invisible to human perception. To evaluate the communication delay during RRS, we measured the RTT of the network line and the packet loss of image signals. RTT is composed of communication line delay (SINET) (Fig. 2).

Fig. 1
figure 1

Schema of network connection for remote robotic surgery. The operating rooms at Hokkaido University Hospital and Kyushu University Hospital (network communication distance of nearly 2,000 km) were connected by SINET. SINET Science Information NETwork, HUH Hokkaido University Hospital, KUH Kyushu University Hospital

Fig. 2
figure 2

Network system. The round-trip time (RTT) is composed of communication line delay (SINET). SINET Science Information NETwork

Robot system

We used the hinotori™ surgical robot system (Medicaroid Corporation, Kobe, Japan), the first made-in-Japan robotic system, which received regulatory approval from the Japanese Ministry of Health, Labor, and Welfare in August, 2020. A Karl Storz™ 3D endoscope system (Karl Storz, Tuttlingen, Germany) was installed in the system.

Task: intracorporeal suturing

At least three throws of the suture were made, including one double throw and two single throws. The time was measured from when the instrument appeared on the monitor until when the suture material and needle were cut. The task completion time and results of the technical evaluation using GEARS were recorded [9]. The technical evaluation was conducted by two physicians certified in the endoscopic surgical skill qualification system of the Japan Society for Endoscopic Surgery (JSES) [10]. The subjective evaluation of the surgeon was validated using PFS-12 [11].

Statistical analysis

Each test score was compared between groups using the Mann–Whitney U test for continuous variables. Statistical significance was set at P < 0.05. Statistical analysis was performed using the JMP® 15 software (SAS Institute Inc., Cary, NC, USA).

Results

Minimum required CB for RRS using the hinotori™

Figure 3 shows an example of the network communication delay, packet loss, and jitter for each CB (500–300–200–150–145 Mbps). Ten surgeons attempted a simple task such as ring movement in each CB. At 145 Mbps CB, the packet loss was noticeable (3.0–7.0%) and image degradation was observed (Fig. 4). However, the RTT and jitter did not change (RTT, 30–30.4 ms; jitter, 0–0.35 ms).

Fig. 3
figure 3

The packet loss and jitter for communication bandwidths of 500 Mbps, 300 Mbps, 200 Mbps, 150 Mbps, and 145 Mbps. At 145 Mbps, the packet loss was noticeable (3–7%), but the round-trip time (RTT) and jitter did not change (RTT, 30–30.4 ms; jitter, 0–0.35 ms)

Fig. 4
figure 4

Operation image from 145 Mbps communication bandwidth. At 145 Mbps, image degradation was observed

Completing the task: intracorporeal suturing

Five surgeons attempted simulation of intracorporeal suturing with a CB of 145 Mbps, which was revealed to be the minimum required CB for the robot system in the former experiment. Concerning changes in network communication delay in VC2 and VC3, the RTT and jitter did not change (RTT, 30–31.5 ms; jitter, 0–0.6 ms), and no packet loss was observed (Fig. 5). The total amount of communication data, including the robot control signal, under a CB of 145 Mbps was 130–155 Mbps for VC1, 50–65 Mbps for VC2, and 35–40 Mbps for VC3 (Fig. 6). The intracorporeal suturing completion time was significantly longer in VC1 than in VC2 or VC3, at 667.4 ± 56.4 s, 275.8 ± 73.9 s, and 236.4 ± 42.5 s, respectively (VC1 vs. VC2, P = 0.009; VC1 vs. VC3, P = 0.009; VC2 vs. VC3, P = 0.209; Fig. 7a). The GEARS score was significantly lower in VC1 than in VC2 or VC3, at 17.4 ± 1.7, 26.6 ± 3.4, and 27.2 ± 1.5, respectively; (VC1 vs. VC2, P = 0.008; VC1 vs. VC3, P = 0.009; VC2 vs. VC3, P = 0.829; Fig. 7b). The PFS-12 score was significantly higher in VC1 than in VC2 or VC3, at 98.8 ± 18.1, 34.4 ± 20.5, and 33.2 ± 28.1, respectively (VC1 vs. VC2, P = 0.009; VC1 vs. VC3, P = 0.009; VC2 vs. VC3, P = 0.917; Fig. 7c).

Fig. 5
figure 5

Comparison of round-trip time (RTT), packet loss, and jitter depending on the video compression (VC) amount. At VC 1 (120 Mbps), packet loss was between 3 and 7%. At VC 2 (40 Mbps) and VC 3 (20 Mbps), packet loss was not observed. RTT and jitter showed no changes (RTT, 30–31.5 ms, jitter, 0–1.0 ms) for all VC

Fig. 6
figure 6

Total amount of communication data under communication bandwidth of 145 Mbps. The total amount of communication data (including the robot control signal) was 130–155 Mbps at video compression (VC) 1 (120 Mbps), 50–65 Mbps at VC 2 (40 Mbps), and 35–40 Mbps at VC 3 (20 Mbps)

Fig. 7
figure 7

Intracorporeal suturing completion time and robotic skill evaluation using the Global Evaluative Assessment of Robotic Skills (GEARS). The subjective evaluation of the surgeon was validated using the System and Piper Fatigue Scale-12 (PFS-12). a Intracorporeal suturing completion time: VC1 significantly prolonged the task completion time versus that with VC2 and VC3 (P = 0.009 and P = 0.009, respectively). b GEARS scores with VC1 were significantly lower than those with VC2 and VC3 (P = 0.008 and P = 0.009, respectively). c PFS-12 scores with VC1 were significantly higher than those with VC2 and VC3 (P = 0.009 and P = 0.009, respectively). VC: video compression (VC1, 120 Mbps; VC2, 40 Mbps; VC3, 20 Mbps)

Discussion

For this study, we set up the Japanese-made surgical robot system, hinotori™, in an operating room 2000 km away from the operator, to investigate the feasibility of RRS and confirmed the robot’s behavior in an environment where surgery is possible. Through this SINET connection verification, we confirmed that there was no recognizable communication delay or image degradation at a CB of more than 150 Mbps. Our findings also suggested that image degradation could be avoided by considering the amount of VC, even when the available CB is insufficient.

Telemedicine has become an inevitable trend through the development of modern medical technology. Teleconsultation, telediagnosis, mobile wards, remote patient image sharing, remote emergency treatment, image sharing and emergency treatment for stroke, digital operating rooms, and distance education have made considerable progress [12,13,14,15,16,17]. The development of RRS has been especially remarkable. Using the ZEUS robotic system and the Transatlantic Optical Faber Network, Jack Marescaux [18, 19] performed the first remote cholecystectomy clinically. This procedure, also known as Lindbergh surgery, is considered a milestone in telesurgery, following which 22 telesurgeries were performed at a hospital in North Bay, approximately 400 km north of Hamilton, Canada [20]. Although both surgeries were successful, the transatlantic connection used an expensive dedicated line (10 Mbps CB), whereas the Canadian clinical case used an Internet Protocol-Virtual Private Network line, a special inter-hospital network developed by the government (15 Mbps CB). In USA, Florida Hospital has successfully performed robot-assisted remote surgery using the Internet. Surgeons in Texas, 1200 miles away from Florida, remotely controlled a da Vinci robot to operate on a simulated patient via the Internet [21]. In Japan, robotic telesurgical simulation for training was first reported by Hashizume et al. [22]. Consequently, the underdeveloped information and communication technology was a decisive factor that led to a long hiatus in telesurgery research [23]. The recent development of high-speed, high-capacity communication technology using optical fiber and 5G, with new surgical robots, is making remote surgery a reality [24]. The bandwidths of the optical fiber and 5G network were 1 Gbps, which is comparable to the bearing capacity of the Internet and 100 times wider than that of the satellite network [1]. It is expected that robotic surgery using the Internet will develop further with the evolution of technology.

Despite this promising outlook, there are problems with RRS that need to be solved, one of which is the establishment of a stable communication environment. Communication delays during RRS can be a major obstacle to safe surgical procedures [2,3,4,5]. It has been reported that operability decreases when the transmission delay time perceived by the surgeon exceeds 200 ms, that errors increase when the delay time exceeds 300 ms [25, 26], and that surgery becomes almost impossible when the delay time exceeds 700 ms [27]. Many reports suggest that the delay time should be less than 200 ms, and ideally less than 100 ms, for a successful robot operation to be completed normally [28, 29].

In this study, it was possible to operate with minimal delay (< 30 ms) at all CBs; however, image degradation was observed at 145 Mbps CB. When robot control signals and audio signals were included in addition to the image signals, the traffic from all the signals exceeded 145 Mbps, and image degradation was observed. Because of the degraded images, the task completion time increased and the surgeon became more fatigued. The reason why image degradation rather than image delay occurred when reducing the CB was thought to be the adoption of traffic policing, which cuts off some of the traffic that exceeds the rate limit on SINET lines. At 145 Mbps, by changing the VC amount (VC 2: 40 Mbps, VC 3: 20 Mbps), image degradation disappeared at the same CB and we could not discern any decrease in image quality. In addition to image degradation, information and communication processing technology to compress and decompress transmission data are also important. The largest volume of transmission signals in the RRS is the video signal, which is strongly affected by the CB. Therefore, information and compression processing technologies are essential; however, the compression and decompression processes also cause delays. Because there is a trade-off between the compression ratio and time required for compression and decompression, it is necessary to develop encoders and decoders that achieve high compression and low delay. In this study, excessive image capacity load might cause image degradation, and the amount of VC needed to be adjusted as a countermeasure.

In this study, despite the long communication distance of approximately a 4000 km round-trip, we were able to communicate 3D 2 K images without image degradation with an RTT of 30 ms, which hardly affected the surgeon’s performance. Furthermore, even with a CB of 145 Mbps, we were able to perform the task without any image degradation or delay, using image compression technology. In the future, the limitations of CB may be overcome by the development of encoders, decoders, and 5G communication technologies that enable low-latency transmission of high-precision images, such as 8 K and 16 K.

In this study, we demonstrated that hinotori™ can be used in commercial communications by selecting a bandwidth type of service of more than 150 Mbps. Currently, there are two types of commercial communication networks, open and closed networks, which differ in their degree of security assurance, communication quality, and cost. In RRS, it is important to select a communication network based on the premise of sufficient communication quality and security, while considering economic efficiency. For future clinical applications of RRS, it would be desirable to develop guidelines for optimal communication systems focusing on safety, ethics, and costs.

In recent years, 5G communication technology has been reported to have advantages such as high speed and large capacity communication, high mobility, multiple connections, and wide bandwidth, which will be beneficial for robots that require a wide bandwidth for high-quality transmission, such as 4 K/8 K video [1]. The advantages of a 5G network over a 4G network also include wider bandwidth and lower latency time and, unlike the wired Internet, the 5G wireless network has high mobility and eliminates the regional restriction of special network cables. Therefore, it is expected that RSS will be realized in isolated islands and disaster areas, where it is difficult to lay wired Internet cables. Moreover, surgeries performed during the coronavirus disease pandemic era need to avoid infection crises from the flow of people. In this situation, RSS using 5G will support remote surgeries in regional hospitals throughout Japan and help train young surgeons.

This study has several limitations. First, because of the limited duration of the experiment, the number of tasks was small and the time to practice the robot operation was short. Second, the image quality was evaluated based on the surgeon’s impression, and no objective data analysis was conducted. In the future, remote surgery using high-precision images will be possible, and a universal image evaluation method will be necessary.

Conclusion

RRS using the novel hinotori™ surgical robot system can be performed safely if the CB is ≥ 150 Mbps. RRS can be implemented in society using currently available commercial communication networks.