On Monday, December 2 at 1.30 pm in Room P-702 (Paulinum), Mohamed Morsey will give a final rehearsal for his PhD defense “Efficient Extraction and Query Benchmarking of Wikipedia Data”. Guests are encouraged to both provide feedback about improvements to the talk and ask preparatory questions.
As always, Bachelor and Master students are able to get points for attendance and there is complimentary coffee and cake after the session.
Efficient Extraction and Query Benchmarking of Wikipedia Data
The thesis consists of two major parts:
- Semantic Data Extraction: the objective of that part is to extract data from semi-structured source, i.e. Wikipedia, and transform it into a networked knowledge base, i.e. DBpedia. Furthermore, maintaining the up-to-dateness of that knowledge base to be always in synchronization with Wikipedia.
- Triplestore Performance Evaluation: normally the semantic data is stored on a triplestore, e.g. Virtuoso, in order to enable the efficient querying of that data. In that part we have developed a new benchmark for evaluating and contrasting the performance of various triplestores.