The Quest for Research Information

Ina Blümel, Stefan Dietze, Lambert Heller, Robert Jäschke, Martin Mehlberg
2014 Procedia Computer Science  
Research information, i.e., data about research projects, organisations, researchers or research outputs such as publications or patents, is spread across the web, usually residing in institutional and personal web pages or in semi-open databases and information systems. While there exists a wealth of unstructured information, structured data is limited and often exposed following proprietary or less-established schemas and interfaces. Therefore, a holistic and consistent view on research
more » ... ation across organisational and national boundaries is not feasible. On the other hand, web crawling and information extraction techniques have matured throughout the last decade, allowing for automated approaches of harvesting, extracting and consolidating research information into a more coherent knowledge graph. In this work, we give an overview of the current state of the art in research information sharing on the web and present initial ideas towards a more holistic approach for boot-strapping research information from available web sources.
doi:10.1016/j.procs.2014.06.040 fatcat:x7qpwy6gsrdeldhcn2fppd6ryy