Tag Archives: Scholarly data

Research Assistant position in our research group (KMi, The Open University)

Our research group at KMi, The Open University, is searching for a research assistant to work on the analysis of Big Scholarly Data in the context of Rexplore, a system which provides an innovative environment for exploring and making sense of scholarly data.

We are offering a paid internship for an initial period of 6 months, with the possibility for renewal. We believe it is a good opportunity for an undergrad or a master student to be part of a high-profile research team, under the supervision of Enrico Motta, Professor of Knowledge Technologies at the Open University. They would also have the opportunity of collaborate with major international publishers such as Elsevier and Springer Nature.

Specific tasks of the role are as follows:
– Integration and management of Big Data in the academic domain;
– Developing and testing the Rexplore technology as a service to be used by publishers and universities worldwide;
– Contributing to the creation of innovation algorithms to extract information from research data and to forecast the flow of knowledge in the research domain.

I would be grateful if you could flag this position to friend of yours who may be interested in this opportunity.

For further information please write to francesco.osborne@open.ac.uk.

 

SAVE-SD 2017: a meeting point for the Scholarly Data Community

I am organising with Alejandra Gonzalez-Beltran, Silvio Peroni and Sahar Vahdati the third edition of the Semantics, Analytics, Visualisation: Enhancing Scholarly Data workshop (SAVE-SD 2017). The SAVE-SD workshop aims to bring together publishers, companies and researchers from different fields to bridge the gap between the theoretical and practical aspects in regards to scholarly data. It was the first workshops to experiment with RASH, the novel HTML-based format that permits to embed semantic annotations within a paper and is now accepted by the main Semantic Web conferences.

SAVE-SD 17 is co-located with WWW 2017 and will take place April 3 2016 in Perth, Australia. The submission deadline is January 31, 2016.

More information on SAVE-SD 2017 are available at http://cs.unibo.it/save-sd/2017/index.html.

Talking about Ontology Forecasting and Technology Extraction at EKAW 2016

Last week I presented two research papers at the 20th International Conference on Knowledge Engineering and Knowledge Management (EKAW 2016) in Bologna, Italy.

The first one introduces TechMiner, a novel tool which combines NLP, machine learning and semantic technologies, for mining technologies from research publications and generating an OWL ontology describing their relationships with other research entities. The resulting knowledge base can support a number of tasks, such as: richer semantic search, richer expert search, monitoring the emergence and impact of new technologies, studying the scholarly dynamics associated with the emergence of new technologies, and others.

The second paper deal with the novel task of ontology forecasting and introduces the Semantic Innovation Forecast (SIF) model, which predicts the concepts that will enrich an ontology in the future. Indeed, ontologies representing scientific disciplines contain only the research topics that are already popular enough to be selected by human experts or automatic algorithms. They are thus unfit to support tasks which require the ability of describing and exploring the forefront of research, such as trend detection and horizon scanning. SIF instead allows to forecast future ontologies by analysing lexical innovation and adoption information extracted from historical data.

The papers presented at EKAW 2016 are the following:

A new solution for classifying scholarly publications: Smart Topic Miner

The process of classifying scholarly outputs is crucial to ensure timely access to knowledge. This process is typically carried out manually by expert editors, leading to high costs and slow throughput. For these reasons, the Rexplore team, in collaboration with Springer Nature, created Smart Topic Miner (STM), a novel solution which uses semantic web technologies to classify scholarly publications on the basis of a very large automatically generated ontology of research areas.

STM was developed to support the Springer Nature Computer Science editorial team in classifying proceedings in the LNCS family, consisting in about 800 proceedings books each year. It analyses in real time a set of publications provided by an editor and produces a structured set of topics and a number of Springer Nature Classification tags, which best characterise the proceedings book. Differently from other applications which characterize a text with topics, STM produces a full taxonomy of the relevant research areas rather than a flat list of keywords or categories. This helps editors and users to understand the context of each topic and its relationships with other research areas.

You can try a public demo of STM at http://rexplore.kmi.open.ac.uk/STM_demo/

Relevant paper:

See you at ISWC 2015

I will be at ISWC 2015 October 11-15 to present the paper “Klink-2: Integrating Multiple Web Sources to Generate Semantic Topic Networks” about the automatic generation of large-scale ontologies of research topics. I will introduce Klink-2, a novel approach which analyses networks of research entities (including papers, authors, venues, and technologies) to infer three kinds of semantic relationships between topics. It also identifies ambiguous keywords (e.g., “ontology”) and separates them into the appropriate distinct topics – e.g., “ontology/philosophy” vs. “ontology/semantic web”. I am using this approach in Rexplore to foster a number of research analytics.

I will also present a poster/demo about the RASH Framework, a set of specifications and writing/conversion/extraction tools for writing academic articles in RASH (Research Articles in Simplified HTML), a HTML-based format that permits to embed RDFa annotations and Turtle statements within a document. This format was adopted already by a number of workshops this year at conferences such as ESWC, ISWC and WWW, and it is spreading quickly and raising the interest of a number of editors and conference organisers.

See you soon in Bethlehem, Pennsylvania!