My organization provides a basic data platform where journalists can upload and share static data (typically CSVs). We’ve begun exploring DCAT and vocabularies such as Data Cube as a way to build a knowledge graph and improve the discoverability and interoperability of data on the platform. We’re in the earliest research stages and I was hoping folks here could provide us with a sanity check and some guidance on a few basic questions related to cataloguing.
Specifically, there are some concepts built into our platform that don’t seem to be reflected in DCAT (even the latest v3). For example, our top-level concept is a Project, which typically includes one or more Datasets and related Distributions. Projects also typically have related resources such as READMEs, training videos, and other supplementary material that help users understand how to interpret and work with Datasets contained in a project. These related resources also do not seem to have clear corollaries in DCAT. I’m wondering how more experienced folks on this list would handle this situation. Would you recommend creating a DCAT Profile that extends the base vocabulary? Or perhaps there’s an existing DCAT profile or an entirely different vocabulary that covers these concepts?
We’re new to linked data in general, so any and all advice is greatly appreciated!