Hi
We are starting to investigate the ROR dataset, with a view to integrating it with our ECRIN metadata repository system for clinical research. Our data includes academic, healthcare, commercial and governmental organisations acting as sponsors and funders for clinical research studies, publishers and managers of data objects, and as listed within the affiliations of individuals.
(Info on ECRIN can be found at Facilitating European Clinical Research | ECRIN, on the metadata repository at ECRIN-MDR Wiki (ecrin-mdr.online), while the portal itself, under development, is at ECRIN | MetaData Repository Web portal (crmdr.org))
The process will likely take a few months, but the first step has been to download the ROR data and put it into a database so that we can examine the data in more detail (so this post may be 1 of n). If anyone is interested, the code for doing that is at
ecrin-github/ror_data: Downloads ROR dataset and loads to Postgres tables
It is a small .net core C# console app that splits the data and stores it in PostgreSQL tables – further information is in the README file. It works with the current ROR dataset – with the added relationship and location data. I hope it is useful for others. Also happy to share thoughts with anyone else doing similar work.
Cheers
Steve