Keywords

1 Introduction

As a series of systems deep geochemical exploration, space remote sensing and geodetic survey, were established and improved. Vast amounts of data have been gradually accumulated in the area of earth science, and these data need to be made use of by scientists. Association rules is a data mining method. It was first put forward by Agrawal et al. [1] in 1993. Now it has become one of the most widely used algorithms in the field of data mining. Li et al. [2] has proposed a new method for conversion-cloud information instead of ā€œhardā€ division which divides a cloud environment into several quantitative concepts based on cloud model considering fully fuzziness and randomness of data. Subsequently, the cloud conversion has been used in many fields. Hu et al. [3] proposes a new way of figuring out the weight of land evaluation factors by mapping qualitative linguistic words into a fine-changeable cloud drops and translating the uncertain factor conditions into quantitative values with the uncertain illation based on cloud model. Fang and Yu [4] attempts to evaluate the economics of wind power projects based on the cloud model. Han et al. [5] imports the conceptual partition algorithm based on cloud model to finish the exchange from qualification to quantification. Expectation of some concept is not just a numeric value but in a sequence, so trapezoidal cloud makes description more accord with contiguous data. Wang [6] uses trapezoidal cloud model to advance the concepts of division, and transforms qualitative data in to quantitative conception which is proved to be effective. Complexity of geosciences data behaves as not only fuzziness and uncertainty but also large amount in quantities and global continuity.

In this paper, a traditional trapezoidal cloud transformation is described in order to avoid a lack of information about data mutation and to carry out a reasonable and sensitive exchange from qualification to quantification.

2 Construction of Association Rules Mining Model Based on Ameliorated Trapezoidal Cloud Transformation

2.1 Concept Partition Algorithm Based on Ameliorated Trapezoidal Cloud Transformation

According to the basic idea of data fitting and a certain rule [5], spatial data of any irregular distribution are mathematically transformed so as to generate a set of atomic concepts and make the distributed spatial data become the superposition of several concept of different size, the basic idea is expressed in Eq.Ā (1).

$$ g(\text{x}) = \sum\nolimits_{i = 1}^{n} {(\text{c}_{i} * {\text{f}}_{i} (\text{x}))} + \varepsilon \,and\,0 < Max(\left| {g(\text{x}) - \sum\nolimits_{i = 1}^{n} {({\text{c}}_{i} *\text{f}_{i} (\text{x}))} } \right|) < \varepsilon $$
(1)

Geosciences data bears global continuity. As the concept is described, its expectation is not just a numeric value but in a sequence, so trapezoidal cloud makes description more accord with the features of geoscience data. In this paper, trapezoidal cloud is adopted to assist concept division, i.e., dividing a concept into atomic concepts inĀ a numberĀ field. FigureĀ 1Ā showsĀ the structure of trapezoidal cloud.

Fig.Ā 1.
figure 1

Diagram of structure of trapezoidal cloud

The function \( {\text{Find}}\_{\text{E}}{\text{n}} \) is used to search for entropy and hyper entropy of cloud droplets. The idea of backward cloud generator [2] is applied during the search.

2.2 Association Rules Mining Algorithm Based on Multi-Level Association Rules Algorithm (MLAR)

Association rules is an important branch in the research area of data mining. It is aimed at mining correlations hidden behind massive data. As a large amount of data is collected and stored, it is increasingly demanded by scientists to discover knowledge from the data.

3 Prediction Model of Fault Property in Chengdu Office Area

In this paper, we will take Chengdu Office area (28Ā°~32Ā° E, 102Ā°~108Ā° N) as research zone to determine and predict nature, position and scale of simulated faults by the model of multilevel association rules algorithm based on improved trapezium-cloud model according the geophysical information 1:1,000,000 geologic map of this area conveys.

3.1 Data Preparation and Attribute Extraction Simulation

In this paper, we take fault in Chengdu Office area (28Ā°~32Ā° E, 102Ā°~108Ā° N) as the research subject in processing known fault data. FigureĀ 2 presents the spatial data used by this work.

Fig.Ā 2.
figure 2

SatelliteĀ image of known faults in Chengdu office zone

Current findings in the field of earth science are built on the fusion of previous space data. According to prior knowledge of experts, subjective judgments on new knowledge are made by using visual interpretation, field measurements and other methods. Rare automated discoveries on new knowledge of earth science are achieved by the degree of inference through association rules, the methods previous adopted are difficult and inefficient.

3.2 Prediction of Faults Property in Chengdu Office Area Based on Ameliorated Trapezoidal Cloud Transformation

According to the data distribution curve of attributes of known and simulated faults (the blue curve in the Fig.Ā 3), the threshold value of error is set at 0.5. It is defined as the input item, when the respective discretization of the number attribute of the 32 attributes in the table is implemented. After that, the trapezium-cloud concept of every attribute is built.

Fig.Ā 3.
figure 3

Line graph of normalĀ distribution fitting data distribution

3.3 Result Validation

In order to judge the quality of association rules, we introduced the scoring mechanism [5]: ScoreĀ =Ā 40Ā % * supportĀ +Ā 60Ā % * confidence and the higher score the better quality on simulated faults. TableĀ 1 presents the top 12 association rules used for the experiment on disposal of simulated faults.

TableĀ 1. Association rules

TableĀ 2 shows the simulation results. When the length of 40 steps (about 60Ā km) is chose as a parameter, 42444 simulated faults in each angel were covered. After successive adjustments, it is inferred that 2305 faults are in a north-east direction(35ā€“50Ā°)and most of them are in 40Ā°; 295 faults are in north-south direction; 354 faults are in a north-west direction (270Ā°ā€“360Ā°).

TableĀ 2. Fracture property and direction proportion

It is important to note that unclassified faults mean that they do not conform to any one of the 12 association rules above. The superposition of the part verified by the simulated faults and the satellite image of this area is shown in Fig.Ā 4.

Fig.Ā 4.
figure 4

SatelliteĀ image of simulated faults verified

4 Conclusion

This paper proposed the association rules reasoning model based on ameliorated trapezoidal cloud transformation, which is aimed primarily at complexity and randomness geosciences data bears. The traditional trapezoidal cloud transformation is improved in order to avoid lack of data mutation information and to finish reasonable and sensitive exchange from qualification to quantification. The attributes of simulated faults extraction algorithm was designed to overcome the limitations of traditional visualĀ interpretation to ensure the effectiveness and completeness of the test data. MLAR model was adopted to reason and predict the unknown faults and fault property in Chengdu office zone. The resultĀ supports the judgements other academics made for the faults and their attributes in probability perspective, further explainsĀ it acts better in association mining between fault types and its attribute data automatically, through which theĀ modelā€™sĀ reliabilityĀ hasĀ beenĀ testified.