Abstract
Within Électricité de France (EDF) hydroelectric power stations, IGCBoxes are industrial mini PCs dedicated to industrial process data archiving. These equipments expose distinctive features, mainly on their storage system based exclusively on flash memory due to environmental constraints. This type of memory had notable consequences on data acquisition performance, with a substantial drop compared with hard disk drives. In this setting, we have designed Chronos, an open-source NoSQL system for sensor data management on flash memories. Chronos includes an efficient quasi-sequential write pattern along with an index management technique adapted for process data management. As a result, Chronos supports a higher velocity for inserted data, with acquisition rates improved by a factor of 20–54 over different solutions, therefore solving a practical bottleneck for EDF.
Similar content being viewed by others
Notes
SSD Mtron MSD SATA3035-032.
Flash chip HYNIX HY27UG088G5B with an ALCOR AU6983HL controller.
Kingston DataTraveler R500 64 Go.
Kingston SD card SD/2GB B000EOMXM0.
See Footnote 2.
See Footnotes 3 and 4.
For instance, for our benchmark (cf. Sect. 4.2), with 17 bytes tuples (among which 12 are the key), and 4096 bytes sectors, \(L = 240\) and \(F = 256\), then 99.6 % of the tree is made of leaves.
Kingston DataTraveler R500 64 Go.
MySQL is CPU bound for insertions on hard disk drives, all other queries and systems combinations are I/O bound.
AMD Geode 500 MHz, 2 GB CompactFlash storage device.
References
Pasteur, O., Léger, S.: Results of the use of MySQL free DBMS as a data historian. EDF internal technical repport, H-P1D-2007-02670-FR (2007)
Bouganim, L., Jónsson, B.T., Bonnet, P.: uFLIP: Understanding flash IO patterns. In: CIDR’09: 4th Biennial Conference on Innovative Data Systems Research, Asilomar, USA (2009)
Wang, Y., Goda, K., Kitsuregawa, M.: Evaluating non-in-place update techniques for flash-based transaction processing systems. In: DEXA’09: 20th International Conference on Database and Expert Systems Applications, Linz, Austria, pp. 777–791 (2009)
Birrell, A., Isard, M., Thacker, C., Wobber, T.: A design for high-performance flash disks. Oper. Syst. Rev. 41(2), 88–93 (2007)
Chardin, B., Pasteur, O., Petit, J.-M.: An FTL-agnostic layer to improve random write on flash memory. In: FlashDB’11: 1st International Workshop on Flash-Based Database Systems, Hong Kong, China, pp. 214–225 (2011)
Arpaci-Dusseau, A.C., Arpaci-Dusseau, R.H., Prabhakaran, V.: Removing the costs of indirection in flash-based SSDs with nameless writes. In: HotStorage’10: 2nd Workshop on Hot Topics in Storage and File Systems, Boston, USA, pp. 1–5 (2010)
Graefe, G.: Modern B-tree techniques. Found. Trends Databases 3(4), 203–402 (2011)
Olson, M.A., Bostic, K., Seltzer, M.I., Berkeley, D.B.: In: FREENIX’99: 1999 USENIX Annual Technical Conference. FREENIX Track, Monterey, USA, pp. 183–191 (1999)
Transaction Processing Performance Council: TPC Benchmark C Standard Specification (2007)
Transaction Processing Performance Council: TPC Benchmark H Standard Specification (2008)
Arasu, A., Cherniack, M., Galvez, E., Maier, D., Maskey, A.S., Ryvkina, E., Stonebraker, M., Tibbetts, R.: Linear road: a stream data management benchmark. In: VLDB’04: 30th International Conference on Very Large Data Bases, Toronto, Canada, pp. 480–491 (2004)
Pasteur, O.: Overview of the long-term stored data in power generation units. EDF internal technical repport, H-P1D-2007-01076-FR (2007)
Chardin, B., Lacombe, J.-M., Petit, J.-M.: Data historians in the data management landscape. In: TPCTC’12: 4th TPC Technology Conference on Performance Evaluation & Benchmarking, Istanbul, Turkey (2012)
Shafer, I., Sambasivan, R.R., Rowe, A., Ganger, G.R.: Specialized storage for big numeric time series. In: HotStorage’13: 5th Workshop on Hot Topics in Storage and File Systems, pp. 1–5. (2013)
Sumbaly, R., Kreps, J., Gao, L., Feinberg, A., Soman, C., Shah, S.: Serving large-scale batch computed data with project voldemort. In: Proceedings of the 10th USENIX Conference on File and Storage Technologies, FAST’12, pp. 18–18 (2012)
Woodhouse, D.: Jffs : The Journalling Flash File System. http://sources.redhat.com/jffs2/jffs2 (2001)
Konishi, R., Amagai, Y., Sato, K., Hifumi, H., Kihara, S., Moriai, S.: The linux implementation of a log-structured file system. SIGOPS Oper. Syst. Rev. 40(3), 102–107 (2006)
Stoica, R., Athanassoulis, M., Johnson, R., Ailamaki, A.: Evaluating and repairing write performance on flash devices. In: DaMoN’09: 5th International Workshop on Data Management on New Hardware, Providence, USA, pp. 9–14 (2009)
Lee, S.-W., Moon, B.: Design of flash-based DBMS: an in-page logging approach. In: SIGMOD’07: 33rd International Conference on Management of Data, Beijing, China, pp. 55–66 (2007)
Kim, Y.-R., Whang, K.-Y., Song, I.-Y.: Page-differential logging: an efficient and DBMS-independent approach for storing data into flash memory. In: SIGMOD’10: 36th International Conference on Management of Data, Indianapolis, USA, pp. 363–374 (2010)
Li, Y., Hey, B., Luo, Q., Yi, K.: Tree indexing on flash disks. In: ICDE’09: 25th International Conference on Data Engineering, Shanghai, China, pp. 1303–1306 (2009)
Agrawal, D., Ganesan, D., Sitaraman, R., Diao, Y., Singh, S.: Lazy-adaptive tree: an optimized index structure for flash devices. Proc. VLDB Endow. 2(1), 361–372 (2009)
Dobreva, V., Albutiu, M.-C., Brunel, R., Neumann, T., Kemper, A.: Get tracked: a triple store for rfid traceability data. In: Advances in Databases and Information Systems, vol. 7503 of Lecture Notes in Computer Science, Springer, Berlin, pp. 167–180 (2012)
Agrawal, N., Prabhakaran, V., Wobber, T., Davis, J.D., Manasse, M., Panigrahy, R.: Design tradeoffs for SSD performance. In: USENIX’08: 2008 USENIX Annual Technical Conference, Boston, USA, pp. 57–70 (2008)
Lee, S.-W., Moon, B., Park, C.: Advances in flash memory SSD technology for enterprise database applications. In: SIGMOD’09: 35th International Conference on Management of Data, Providence, USA, pp. 863–870 (2009)
Jung, M., Kandemir, M.: Revisiting widely held SSD expectations and rethinking system-level implications. SIGMETRICS Perform. Eval. Rev. 41(1), 203–216 (2013)
Bjorling, M., Bonnet, P., Bouganim, L., Dayan, N.: The necessary death of the block device interface. In: CIDR’13: 6th Biennial Conference on Innovative Data Systems Research, Asilomar, USA (2013)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Chardin, B., Lacombe, JM. & Petit, JM. Chronos: a NoSQL system on flash memory for industrial process data. Distrib Parallel Databases 34, 293–319 (2016). https://doi.org/10.1007/s10619-015-7175-0
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10619-015-7175-0