Data Commons Platform
Data Extraction & Entity Matching Tooling
OS-C’s focus is factors identified by asset owners, asset managers, banks, and regulators as priorities from among SASB, TCFD, CDSB, GRI, and CDP as highly material.
Data Extraction & Entity Matching
Data Extraction Workstream
OS-Climate uses Natural Language Processing (NLP) to extract key metrics from ESG and sustainability reports. We are building AI/ML-based and rules-based engines to extract critical data from non-standardized reports to support our various analytics tools.
Entity Matching
OS-Climate has built tools which provide a standardized way to clean up data to enable legal entity matching and related tasks. The tools provide additional insights into corporate hierarchy and subsidiaries. Legal terms are preserved and translated to the entity’s country of reference. We can match many sources against trustable referential data, so to build a more robust referential.
Watch Videos about Data Commons
Data Commons Architecture and Infrastructure Demo
Speakers: Vincent Caldeira and Erik Erlandson from Red Hat
Summary: Obtain details on the Data Commons components including the data management architecture. Learn how we manage data like code, turning a “data mess” into a “data mesh”.
Data Sources and Linkages Across Datasets (Bonus Video)
Speakers: Michael Tiemann from Red Hat
Summary: Learn more about the data available in the Data Commons, along with how different datasets can be combined to drive insights into climate-smart investing.
Extraction/Transformation of Corporate Data from structured and unstructured sources
Speakers: Lea Deleris from BNP Paribas, Ismail Demir from Allianz IDS GmbH, Jeremy Goh from BNP Paribas, Karan Chauhan from Red Hat, Christian Meyndt
Summary: Learn how users can leverage OS-C’s NLP toolkit to extract key climate data/metrics from unstructured reports like ESG and annual reports.