Summary
The analysis of large data sets in meteorological and air quality studies is often made though the examination of specific case studies, especially when time-consuming computational models are employed. This paper presents the development of a tool to identify specific case studies, termed as representative days, that would subsequently be modelled. The success of such tools should be judged on the discrimination between the specified cases: and the degree to which they capture and recreate historical characteristics of the original data set. The developed approach utilises a principal component algorithm with varimax rotation (r-PCA) and the subtractive clustering algorithm coupled with a cluster validity criterion. In this paper, the developed tool is applied to a data set from the North Sea, utilizing two years worth of data from the DNMI operational forecasting model. The results will be subsequently used in photochemical and radiative forcing modelling tools as part of the EC funded project AEOLOS, with the ultimate goal to estimate the global warming potential of non-radioactive tracing substances such as SF6 and PFCs, which are heavily used in the oil industry.
Similar content being viewed by others
References
RA Bejaran IA Camilloni (2003) ArticleTitleObjective method for classifying air masses: an application to the analysis of Buenos Aires’s urban heat island intensity. Theor Appl Climatol 74 93–103 Occurrence Handle10.1007/s00704-002-0714-4
SL Chiu (1994) ArticleTitleFuzzy model identification based on cluster estimation. J Intel Fuzzy Syst 2 267–278 Occurrence Handle10.1109/91.324806
AKA El-Kadi PA Smithson (1992) ArticleTitleAtmospheric classifications and synoptic climatology. Progr Phys Geog 16 432–455
JS Greene LS Kalkstein H Ye K Smoyer (1999) ArticleTitleRelationships between synoptic climatology and atmospheric pollution at 4 US cities. Theor Appl Climatol 62 163–174 Occurrence Handle10.1007/s007040050081
F Grubbs (1969) ArticleTitleProcedures for detecting outlying observations in samples. Technometrics 11 IssueID1 1–21
HF Kaiser (1958) ArticleTitleThe varimax criterion for analytic rotation in factor analysis. Psychometrika 23 187–200
LS Kalkstein G Tan JA Skindlov (1987) ArticleTitleA comparison of three clustering procedures for use in synoptic climatological classification. J Climate Appl Meteor 19 717–730 Occurrence Handle10.1175/1520-0450(1987)026<0717:AEOTCP>2.0.CO;2
PA Kassomenos HA Flocas S Lykoudis M Petrakis (1998) ArticleTitleAnalysis of mesoscale patterns in relation to synoptic conditions over an urban Mediterranean basin. Theor Appl Climatol 59 215–229 Occurrence Handle10.1007/s007040050025
DJ Kim YW Park DJ Park (2001) ArticleTitleA novel validity index for determination of the optimal number of clusters. IEICE T Inf Syst, E84-D 281–285
FL Ludwig JY Jiang J Chen (1995) ArticleTitleClassification of ozone and weather patterns associated with high ozone concentrations in the San Francisco and Monterey Bay areas. Atmos Environ 29 2915–2928 Occurrence Handle10.1016/1352-2310(95)00091-C
V Moron G Plaut (2003) ArticleTitleThe impact of El Niño-southern oscillation upon weather regimes over Europe and the North Atlantic during boreal winter. Int J Climatol 23 363–379 Occurrence Handle10.1002/joc.890
R Romero G Sumner C Ramis A Genoves (1999) ArticleTitleA classification of the atmospheric circulation patterns producing significant daily rainfall in the Spanish Mediterranean area. Int J Climatol 19 765–785 Occurrence Handle10.1002/(SICI)1097-0088(19990615)19:7<765::AID-JOC388>3.0.CO;2-T
Preisendorfer RW, Mobley CD (1988) Principal components analysis in meteorology and oceanography. Amsterdam: Elsevier
C Serra G Fernadez-Mills MC Periago X Lana (1999) ArticleTitleWinter synoptic types in Catalonia and their linkage with minimum temperature anomalies. Int J Climatol 19 1675–1695 Occurrence Handle10.1002/(SICI)1097-0088(199912)19:15<1675::AID-JOC440>3.3.CO;2-X
M Shahgedanova TP Burt TD Davies (1998) ArticleTitleSynoptic climatology of air pollution in Moscow. Theor Appl Climatol 61 85–102 Occurrence Handle10.1007/s007040050054
S Sheridan (2002) ArticleTitleThe redevelopment of a weather-type classification scheme for North America. Int J Climatol 22 51–68 Occurrence Handle10.1002/joc.709
Sorensen JH, Rasmussen A (1997) Mixing height derived from the DMI-HIRLAM NWP model and used for ETEX dispersion modelling. Proc EURASAP Workshop, Riso-R-997
G Sumner (1996) ArticleTitleDaily precipitation patterns over Wales: towards a detailed precipitation climatology. T I Brit Geog 21 157–176
T Tirabassi S Nassetti (1999) ArticleTitleThe representative day. Atmos Environ 33 2427–2434 Occurrence Handle10.1016/S1352-2310(98)00371-9
B Yarnal AC Comrie BJ Frakes DP Brown (2001) ArticleTitleDevelopments and prospects in synoptic climatology. Int J Climatol 21 1923–1950 Occurrence Handle10.1002/joc.675
WMO (2001) Aerodrome reports and forecasts: A user’s handbook to the codes. WMO Publication
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Sfetsos, A., Vlachogiannis, D., Gounaris, N. et al. On the identification of representative samples from large data sets, with application to synoptic climatology. Theor. Appl. Climatol. 82, 177–182 (2005). https://doi.org/10.1007/s00704-005-0128-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00704-005-0128-1