dr. Willem van Hage

Willem van Hage

E-mail 
Social networks  Follow wrvhage on Twitter My Facebook profile


Netherlands eScience Center
Netherlands eScience Center
Science Park 140,
1098 XG Amsterdam
T 06-15886039

Vrije Universiteit Address
Web & Media Group < Computer Science
< Faculty of Sciences
< VU University Amsterdam
De Boelelaan 1081A,
1081 HV Amsterdam,
Room U332
T 020-5988785,
F 020-5987653


I am Technical Lead data management & analytics at the Netherlands eScience Center and guest researcher in the Web & Media group at the VU. I coordinate scientific software development and research projects in various domains (geography, climatology, law, physics, etc.) that have to do with analytics. I do research on information integration, visualization, and event modeling, focussing on improving the speed, transparency, and reproducibility of the research process, with a special interest in explorative and creative research.

My PhD thesis about ontology alignment is available here: “Evaluating Ontology-Alignment Techniques” and my MSc thesis about the border between ontology engineering and information retrieval is available here: “Living on the Edge”.

Research Linked Open Piracy, Space package, SEM

My research mainly deals with Information Integration, Visualization, and Reasoning.

Computer science topics that occupy me are: Ontology Alignment, Integrating Spatial reasoning and Semantics, Trust on the Web, Data Mining, specifically mining knowledge from the World Wide Web, The Semantic Web, Description Logic, combining Logical and Statistical reasoning, Information Retrieval & Extraction.

Some other topics that interest me are: Information Theory, Program Algebra, Logical & Functional programming languages.

Program Committees

I am co-chair of DeRiVE 2012, and DeRiVE 2011.

I am a PC member of ISWC 2013 In Use track, Cooking with Computers (CwC) 2013 at IJCAI, ESWC 2013, ISWC 2012 Evaluation track and In Use track, GiBDA 2012, AIW 2012, ISWC 2011, Web-KR 2011, LSTD 2010. ISWC 2009, ISWC 2009 Doctoral Consortium, and ESWC 2009 Posters & Demos.

I have reviewed papers for the Artificial Intelligence journal the Journal of Web Semantics, the Semantic Web Journal and Information Systems; and for the following conferences, workshops, and summer schools: EKAW 2012, IJCAI 2011, KEOD 2010, FOIS 2010, KCAP 2005-2009, ESWC 2009, ICSD 2009, ECAI 2009, SemEval 2009 workshop (at ACM 2009), ESSLLI 2006, and ASWC 2006.

SPARQL client for R

I wrote a SPARQL client for the R statistical programming language together with Tomi Kauppinen and a number of additional contributors.


Tutorials can be found at:


This client allows you to directly import results of SPARQL SELECT queries into the statistical environment of R as a data frame. That means you can directly perform statistical analysis on data sets on the web. For example, you can use the following R code to get data from the Linked Open Piracy SPARQL end point described below.

library(SPARQL)
endpoint <- "http://semanticweb.cs.vu.nl/lop/sparql/"
q <-
  "SELECT *
   WHERE {
     ?event sem:hasPlace ?place .
     ?place eez:inPiracyRegion ?region .
   }"
prefix <- c("lop","http://semanticweb.cs.vu.nl/poseidon/ns/instances/",
            "eez","http://semanticweb.cs.vu.nl/poseidon/ns/eez/")
res <- SPARQL(endpoint,q,ns=prefix)$results
pie(sort(table(res$region)),col=rainbow(12))

This yields the following overview of the world-wide distribution of piracy attacks:

Distribution of piracy attacks

With a little more effort you can plot correlations.

q <-
  "SELECT *
   WHERE {
     ?event sem:eventType ?event_type .
     ?event sem:hasPlace ?place .
     ?place eez:inPiracyRegion ?region .
   }"
res <- SPARQL(endpoint,q,ns=prefix)$results
restable <- table(res$event_type,res$region)
par(mar=c(4,10,1,1))
barplot(restable,col=rainbow(10),horiz=T,las=1,cex.names=0.8)
legend("topright",rownames(restable),
       cex=0.8,bty="n",fill=rainbow(10))

This yields the following overview of the distribution of kinds of events per region:
Distribution of piracy attacks



Using the ggmap package you can draw an overview map of the same events. The ggmap package requires you to install the mapproj package.

library(ggmap)
q <-
  "SELECT *
   WHERE {
     ?event sem:eventType ?event_type .
     ?event sem:hasPlace ?place .
     ?place wgs84:lat ?lat .
     ?place wgs84:long ?long .
   }"
res <- SPARQL(endpoint,q,ns=prefix)$results
qmap('Gulf of Aden', zoom=2, legend='bottomright') +
  geom_point(aes(x=long, y=lat, colour=event_type), data=res) +
  scale_color_manual(values = rainbow(10))

This yields the following map:

Geographical distribution of piracy attacks

Linked Open Piracy

Together with Véronique Malaisé, Marieke van Erp, I released Linked Open Piracy (LOP), a Linked Open Data version of the ICC-CCS International Maritime Bureau reports of piracy attacks world wide. This data set is linked to Princeton WordNet 2.0 and 3.0, Freebase, DBpedia, Geonames and Exclusive Economic Zone data from VLIZ.

Piracy in the Gulf of Aden from LOP
Pirate attacks in the Gulf of Aden from the Linked Open Piracy data set.

Pirate attack modeled in SEM
An example pirate attack modeled in the Simple Event Model (see below).

the SWI-Prolog space package

Together with Jan Wielemaker I am working on a Prolog library for spatial indexing called the space package. It provides fast nearest neighbor, intersection, and containment queries. The current version can deal with Points, LineStrings, and Polygons as query shapes or as data shapes. Results are “returned” one by one, non-deterministically. This makes writing combined spatial/semantic/temporal query optimizers in Prolog relatively easy. The space package also provides back and forth conversion between a number of geospatial/geosemantic standards, like: GML, KML, WKT, and RDF with GeoRSS Simple and GeoRSS GML or with WGS84 properties.

You can find more about the space package in the workshop article The Space package: Tight Integration Between Space and Semantics. The source code of the space package can be downloaded from or browsed on the GIT repository. Documentation can be found in the SWI-Prolog documentation server.

the Simple Event Model

Together with Véronique Malaisé I designed the Simple Event Model (SEM), an ontology to describe events in terms of “who did what with what to whom, where and when?”, respectively modeled as Actors, Events, Objects, Roles, and Places, which are all Time-stamped. Mappings from SEM to other models are included in the SEM RDF file.

the Simple Event Model
The Simple Event Model (SEM)

Publications Event and Behavior Modeling, Augmented Reality, Ontology Alignment

This table is sortable, click the headers.
My favorites are marked with a selection.

Highlight papers about:
Turn highlighting for all papers on | off.
title authors in year
GAF: A Grounded Annotation Framework for Events
Fokkens, A.
et al.
EVENTS 2013 (workshop)
(to be published)
2013
Knowledge Representation in Scientific Models and their Publications: a Case Study
de Vos, M. G.
van Hage, W. R.
Wielemaker, J.
Schreiber, A. Th.
K-CAP 2013 (poster)
(to be published)
2013
Transparency of Environmental Computer Models
de Vos, M. G.
van Hage, W. R.
Schreiber, A. Th.
Do-Form 2013 (workshop)
(to be published)
2013
Proceedings of the Workhop on Detection, Representation, and Exploitation of Events in the Semantic Web (DeRiVE 2012)
van Erp, M.
Hollink, L.
van Hage, W. R.
Troncy, R.
Schamma, D. A.
(editors)
DeRiVE 2012
(workshop proceedings)
2012
Reconstructing Semantics of Scientific Models: a Case Study
de Vos, M. G.
van Hage, W. R.
Ros, J.
Schreiber, A. Th.
OEWD 2012 (workshop)
(to be published)
2012
Trust Evaluation through User Reputation and Provenance Analysis
Ceolin, D.
Groth, P.
van Hage, W. R.
Nottamkandath, A.
Fokkink, W.
URSW 2012 (workshop)
(to be published)
2012
selection Linked Open Piracy: A story about e-Science, Linked Data, and statistics
van Hage, W. R.
Malaisé, V.
van Erp, M.
JoDS (journal)
(to be published)
2012
MultiFarm: A Benchmark for Multilingual Ontology Matching
Meilicke, C.
et al.
JWS (journal)
(to be published)
2012
The Simple Event Model
van Hage, W. R.
Ceolin, D.
Situational Awareness with Systems of Systems (book chapter)
(to be published)
2012
Assessing trust for determining the reliability of information
Ceolin, D.
van Hage, W. R.
Schreiber, G.
Fokkink, W.
Situational Awareness with Systems of Systems (book chapter)
(to be published)
2012
Final results of the Ontology Alignment Evaluation Initiative 2011
Euzenat, J.
et al.
OM 2011
(report)
2011
Proceedings of the Workhop on Detection, Representation, and Exploitation of Events in the Semantic Web (DeRiVE 2011)
van Erp, M.
van Hage, W. R.
Hollink, L.
Jameson, A.
Troncy, R.
(editors)
DeRiVE 2011
(workshop proceedings)
2011
selection Estimating uncertainty of categorical Web data
Ceolin, D.
van Hage, W. R.
Fokkink, W.
Schreiber, G.
URSW 2011 (workshop)
2011
Linked Open Piracy
van Hage, W. R.
Malaisé, V.
van Erp, M.
DeRiVE 2011 (workshop)
2011
Comparing Vessel Trajectories using Geographical Domain Knowledge and Alignments
(MAD 2011 extended abstract)
(Benelearn 2011 extended abstract)
(Benelearn 2011 poster)
de Vries, G.
van Hage, W. R.
van Someren, M.
MAD 2011 (workshop)
and Benelearn 2011 (poster)
2011
selection Design and Use of the Simple Event Model (SEM)
(DOI link)
van Hage, W. R.
Malaisé, V.
Segers, R.
Hollink, L.
JWS (journal)
vol. 9 nr. 2
2011
Linked Open Piracy (poster)
see also: Linked Open Piracy data set, ICC-CCS IMB and NGA WTS piracy reports as Linked Open Data.
van Hage, W. R.
Malaisé, V.
van Erp, M.
K-CAP 2011 (poster) 2011
Parse and Tag Somali Pirates
van Erp, M.
van Hage, W. R.
Malaisé, V.
CLIN 2011
(abstract and presentation)
2011
Comparing Vessel Trajectories using Geographical Domain Knowledge and Alignments
de Vries, G.
van Hage, W. R.
van Someren, M.
SSTDM 2010 (workshop) 2010
Calculating the Trust of Event Descriptions using Provenance
Ceolin, D.
van Hage, W. R.
Groth, P.
STPM 2010 (workshop) 2010
Adaptation Step-by-Step: Challenges for Real-time Spatial Personalization (position paper)
van Hage, W. R.
Stash, N.
Wang, Y.
Aroyo, L.
PUMP 2010 (workshop) 2010
best paper EKAW 2010 in use Mobile Cultural Heritage Guide: Location-Aware Semantic Search
(best paper award in use track)
van Aart, C.
Wielinga, B.
van Hage, W. R.
EKAW 2010 (conference) 2010
selection Abstracting and Reasoning over Ship Trajectories and Web Data with the Simple Event Model (SEM)
(extended version of ACM EiMM09)
(DOI link)
van Hage, W. R.
Malaisé, V.
de Vries, G.
Schreiber, A. Th.
van Someren, M.
MTAP (journal)
online SpringerLink
2011
(january)
An Integrated Approach for Visual Analysis of a Multi-Source Moving Objects Knowledge Base
(extended version)
(DOI link)
Willems, N.
van Hage, W. R.
de Vries, G.
Janssens, J.
Malaisé, V.
IJGIS (journal)
vol. 24, nr. 10
2010
(october)
Finding Your Way through the Rijksmuseum with an Adaptive Mobile Museum Guide
van Hage, W. R.
Stash, N.
Wang, Y.
Aroyo, L.
ESWC 2010 (conference) 2010
A Trust Model to Estimate Quality of Annotations using the Web
Ceolin, D.
van Hage, W. R.
Fokkink, W.
Web Science 2010 (conference) 2010
An Integrated Approach for Visual Analysis of a Multi-Source Moving Objects Knowledge Base
Willems, C. M. E.
van Hage, W. R.
de Vries, G.
Janssens. J.
Malaisé, V.
AGILE 2010 GeoVA(t) (workshop) 2010
selection The Space package: Tight Integration Between Space and Semantics
(extended version)
(DOI link)
van Hage, W. R.
Wielemaker, J.
Schreiber, A. Th.
Transactions in GIS (journal)
vol. 14 nr. 2
2010
selection The OAEI food task: an analysis of a thesaurus alignment task
(DOI link)
van Hage, W. R.
Sini, M.
Finch, L.
Kolb, H. P.
Schreiber, A. Th.
Applied Ontology (journal)
vol. 5, nr. 1
2010
The Space package: Tight Integration Between Space and Semantics
van Hage, W. R.
Wielemaker, J.
Schreiber, A. Th.
TerraCognita 2009 (workshop) 2009
Spatial and Semantic Reasoning to Recognize Ship Behavior (draft)
van Hage, W. R.
Malaisé, V.
de Vries, G.
Schreiber, A. Th.
van Someren, M.
ISWC 2009 (demo & poster) 2009
Combining Ship Trajectories and Semantics with the Simple Event Model (SEM)
van Hage, W. R.
Malaisé, V.
de Vries, G.
Schreiber, A. Th.
van Someren, M.
ACM EiMM09 (workshop) 2009
Structuring and extracting knowledge for the support of hypothesis generation in molecular biology
(DOI link)
Roos, M.
et al.
BMC Bioinformatics (journal)
vol. 10, nr. 10
2009 (october)
Semantic disclosure in an e-Science environment
Marshall, M. S.
et al.
Semantic e-Science (book chapter)
chapter 2
Springer AoIS
2009
selection Evaluating Ontology-Alignment Techniques
van Hage, W. R.
PhD thesis 2008
Biological applications of AIDA knowledge management components (poster abstract)
Roos, M.
et al.
ISMB 2008 (poster) 2008
Relevance-based Evaluation of Alignment Approaches: The OAEI 2007 Food Task Revisited
(poster version)
van Hage, W. R.
Kolb, H. P.
Schreiber, A. Th.
OM 2008 (workshop) 2008
Comparing human and automatic thesaurus mapping approaches in the agricultural domain
Lauser, B.
Johannsen, G.
Caracciolo, C.
Keizer, J.
van Hage, W. R.
Mayr, P.
DC 2008 (conference) 2008
De AIDA toolbox: Een gecombineerde aanpak voor het beheren van kennis
Marshall, M. S.
Roos, M.
Meij, E.
Katrenko, S.
van Hage, W. R.
Adriaans, P. W.
Agroinformatica (journal, short)
vol. 21, nr. 4
2008
My first BioAID: heuristic support for hypothesis construction
(conference paper unavailable)

(poster abstract)
Roos, M.
Katrenko, S.
van Hage, W. R.
Meij, E.
Marshall, M. S.
Adriaans, P. W.
ISMB-ECCB 2007 (conference) 2007
Sample Evaluation of Ontology-Matching Systems
van Hage, W. R.
Isaac, A.
Aleksovski, Z.
EON 2007 (workshop) 2007
A Survey and Categorization of Ontology-Matching Use Cases
(poster version)
Aleksovski, Z.
van Hage, W. R.
Isaac, A.
OM 2007 (workshop) 2007
Results of the Ontology Alignment Evaluation Initiative 2007
Euzenat, J.
et al.
OM 2007 (report) 2007
D2.2.9: Description of alignment evaluation and benchmarking results
Shvaiko, P.
et al.
Deliverable of the Knowledge Web working group on heterogeneity (2.2) 2007
UVAVU: WordNet Similarity and Lexical Patterns for Semantic Relation Classification
(poster version)
van Hage, W. R.
Katrenko, S.
SemEval-2007 (workshop) 2007
Results of the Ontology Alignment Evaluation Initiative 2006
Euzenat, J.
et al.
OM 2006 (report) 2006
selection A Method for Learning Part-Whole Relations
van Hage, W. R.
Kolb, H. P.
Schreiber, A. Th.
ISWC 2006 (conference) 2006
D2.2.4: Alignment implementation and benchmarking results
Stuckenschmidt, H.
et al.
Deliverable of the Knowledge Web working group on heterogeneity (2.2) 2006
selection A Method to Combine Linguistic Ontology-Mapping Techniques
van Hage, W. R.
Katrenko, S.
Schreiber, A. Th.
ISWC 2005 (conference) 2005
Information Retrieval Support for Ontology Construction and Use
van Hage, W. R.
de Rijke, M.
Marx, M.
ISWC 2004 (conference) 2004
“Living on the edge”, Combining structured & semi-structured data for information retrieval
van Hage, W. R.
MSc thesis 2004
The University of Amsterdam at CLEF 2004
Fissaha, S.
van Hage, W. R.
Kamps, J.
Lacerda de Melo, G.
de Rijke, M.
CLEF 2004 (conference) 2004
Towards a Topic Driven Access to Full Text Documents
Caracciolo, C.
van Hage, W. R.
de Rijke, M.
ECDL 2004 (conference) 2004


Teaching Information Retrieval, Ontology Engineering, Web Science minor, MSc projects

I gave the following courses at the VU University Amsterdam at the Faculty of Sciences:

  • Information Retrieval for BSc students in Computer Science and IMM. This course is about topics such as crawling, indexing, various term weighting schemes, and evaluation. Lab exercises involve distributed search on a Hadoop cluster with Katta and Pig Latin on Common Crawl and Wikipedia.
  • Knowledge and Media for MSc students in Informatics, Information Sciences, and Artificial Intelligence. This course deals with modeling decisions in knowledge intensive applications that involve multimedia.
  • Ontology Engineering for MSc students in Informatics, Information Sciences, and Artificial Intelligence. This course treats topics like modeling decisions, ontological commitment, and design patterns in RDF(S), OWL, UML.
  • Computer Science Seminar for MSc students in Computer Science. This is a course about literature study in computer science. It teaches basic scientific skills for critical reading, analyzing, summarizing, comparing, presenting, and debating about scientific articles.

I gave guest lectures for the follwing courses at the VU University Amsterdam at the Faculty of Social Sciences:

  • Marketing and persuasive communication for BSc students communication sciences. A lecture dealing with term weighting, PageRank, and Search Engine Optimization.


I gave guest lectures for the following courses at the Technical University Delft:

  • Information Retrieval for MSc students in Computer Science. A guest lecture about Latent Semantic Indexing.


I gave guest lectures for the following courses at the Universiteit van Amsterdam:

  • Research Methods in AI for BSc students in Artificial Intelligence. A guest lecture about the ideas behind modeling in general and specifically with respect to ontologies, criteria to assess the quality of models, and guidelines for good modeling.
  • Intelligent Interactive Systems for MSc students in Information Sciences. A guest lecture giving an overview of localization: from methods for indoor and outdoor geodesy through encoding and indexing to knowledge intensive location-based applications.

I coordinated the design of the Web Science minor at the VU University Amsterdam's Faculty of Sciences.


I gave a short practical business course on Information Extraction from the Web and Semantic Web techniques using SWI-Prolog in 2011.


I supervise the following PhD students: Davide Ceolin, Jesper Hoeksema.


I supervised the following PhD students: Valentina Maccatrozzo, Martine de Vos


I supervised the following MSc students: Tobin Hupsel, Sander Koers, Aäron Jansen, Juan Manuel Coleto Gómez, Vincent Osinga, Bas van Dijk, Thomas Günther, Wiard Vasen, Vikram Ghotgalkar.


I supervised the following BSc students: Niek Siekman, Renske Augustijn.


I was second reader of the theses of the following MSc students: Lieke Verhelst (TUD), Balthasar Schopman, Jevon van Dijk, Andrea Bravo Balado.

Projects and Networks NewsReader, COMMIT Metis, Poseidon, DeRiVE, OAEI

Personal Karate, etc.

Martial Arts

Currently I practice shotokan karate (松濤館流 空手) at the SKCA with many teachers amongst which Tom Kompier and Bregje Kaars-Sijpesteijn, where I am a first-degree black belt (shodan, 初段 空手). I am a first-degree black belt jūjutsu (shodan, 初段 柔術) with the JBN. I used to practice at the dojo of the USC in Amsterdam with Edwin Hiense (JBN). I am a green belt judo (third kyū, 三級 柔道) with Eric Bos (JBN) when he still taught at the USC.



Kata Guruma
me performing a  kata guruma throw
on/with Karien van Assendelft

Bass guitar


... except mine is fretless


... except mine has a black machine head


I play bass guitar in the (for lack of a better name for the genre) dutch progressive-death-metal-reggae-funk-rock band Gemeengoed.


Computer Generated Art

Below is a gallery of computer generated pictures I made between 1997 and 2001. They were all made using Adobe Photoshop's “Filters” menu and nothing else. (i.e. no paint brushes etc.)

Paradox of Choice

I agree with Barry Schwartz: More choice in our everyday life leads to less happiness.


Barry Schwartz

Free will without non-determinism

I find Daniel Dennett's view on free will is very convincing.


Daniel Dennet