Large and growing data resources on the spatial and temporal diversity and distribution of the more than 400 carbon-bearing mineral species reveal patterns of mineral evolution and ecology. Recent advances in analytical and visualization techniques leverage these data and are propelling mineralogy from a largely descriptive field into one of prediction within complex, integrated, multidimensional systems. These discoveries include: (1) systematic changes in the character of carbon minerals and their networks of coexisting species through deep time; (2) improved statistical predictions of the number and types of carbon minerals that occur on Earth but are yet to be discovered and described; and (3) a range of proposed and ongoing studies related to the quantification of network structures and trends, relation of mineral “natural kinds” to their genetic environments, prediction of the location of mineral species across the globe, examination of the tectonic drivers of mineralization through deep time, quantification of preservational and sampling bias in the mineralogical record, and characterization of feedback relationships between minerals and geochemical environments with microbial populations. These aspects of Earth’s carbon mineralogy underscore the complex co-evolution of the geosphere and biosphere and highlight the possibility for scientific discovery in Earth and planetary systems.
Abstract Minerals contain important clues to understanding the complex geologic history of Earth and other planetary bodies. Therefore, geologists have been collecting mineral samples and compiling data about these samples for centuries. These data have been used to better understand the movement of continental plates, the oxidation of Earth's atmosphere and the water regime of ancient martian landscapes. Datasets found at ‘RRUFF.info/Evolution’ and ‘mindat.org’ have documented a wealth of mineral occurrences around the world. One of the main goals in geoinformatics has been to facilitate discovery by creating and merging datasets from various scientific fields and using statistical methods and visualization tools to inspire and test hypotheses applicable to modelling Earth's past environments. To help achieve this goal, we have compiled physical, chemical and geological properties of minerals and linked them to the above‐mentioned mineral occurrence datasets. As a part of the Deep Time Data Infrastructure, funded by the W.M. Keck Foundation, with significant support from the Deep Carbon Observatory (DCO) and the A.P. Sloan Foundation, GEMI (‘Global Earth Mineral Inventory’) was developed from the need of researchers to have all of the required mineral data visible in a single portal, connected by a robust, yet easy to understand schema. Our data legacy integrates these resources into a digestible format for exploration and analysis and has allowed researchers to gain valuable insights from mineralogical data. GEMI can be considered a network, with every node representing some feature of the datasets, for example, a node can represent geological parameters like colour, hardness or lustre. Exploring subnetworks gives the researcher a specific view of the data required for the task at hand. GEMI is accessible through the DCO Data Portal ( https://dx.deepcarbon.net/11121/6200‐6954‐6634‐8243‐CC ). We describe our efforts in compiling GEMI, the Data Policies for usage and sharing, and the evaluation metrics for this data legacy.