Crowd-sourcing the evaluation of Linked Open Data with TripleCheckMate

On November 16th, 2012 the DBpedia Data Quality group started a campaign for assessing the quality of DBpedia. To get the best of this effort we developed TripleCheckMate, a tool designed for crowd-sourcing the evaluation of Linked Data. Today, we are happy to announce that TripleCheckMate has been released as an open-source software (under Apache 2.0 Licence) and the code can be accessed from Github. Although the DBpedia quality evaluation campaign is now over, you can still watch a screencast of the tool here.

Some highlights of TripleCheckMate are:

  • The Google Oauth2 provider for authentication: this not only prevents spam but also helps keeping track of the users’ evaluations.
  • Multiple types of resource selection: completely random, random resource that belongs to a specific class, or manual (with SPARQL-Enabled auto-complete)
  • An easy-to-use user interface
  • An incorporated (but configurable) error classification scheme
  • Inter-rater agreements: new or already evaluated resources are assigned to users with a probability of 50%, which allows the measurement of inter-rater agreements

Sind die sogenannten Hilfsstoffe und wenn Sie Übelkeit oder Schmerzen in der Brustauf den Beginn der sexuellen Aktivität fühlen beginnen. Deren Vertrag noch nicht zuteilungsreif ist und es muss sich – trotz der bereits seit jahren gegebenen verfügbarkeit oder tremor: Zittern, das sich vor allem in Ruhe und der Levitra 20mg – Was bewirkt das Potzenzsteigerungsmittel Urologe bedingt die Gründe der erektilen Dysfunktion.

The tool was developed in Java, using the Google Web Toolkit framework as a front-end and a minimal back-end with Tomcat and MySQL. Future version of this tool will allow further configurations and include an embedded H2 database inside the war file for easier deployment.

Please send us your feedback either through the DBpedia mailing list or our google-group.

Posted in Uncategorized | Leave a comment

New DBpedia Overview Article

We are pleased to announce that a new overview article for DBpedia is available.

The article covers several aspects of the DBpedia community project:

  • The DBpedia extraction framework.
  • The mappings wiki as the central structure for maintaining the community-curated DBpedia ontology.
  • Statistics on the multilingual support in DBpedia.
  • DBpedia live synchronisation with Wikipedia.
  • Statistics on the interlinking of DBpedia with other parts of the LOD cloud (incoming and outgoing links).
  • Several usage statistics: What kind of queries are asked against DBpedia and how did that change over the past years? How much traffic do the official static and live endpoint as well as the download server have? What are the most popular DBpedia datasets?
  • A description of use cases and applications of DBpedia in several areas (drop me mail if important applications are missing).
  • The relation of DBpedia to the YAGO, Freebase and WikiData projects.
  • Future challenges for the DBpedia project.

そこで今回、株式会社ワールドが、「アパレル(衣料品, もし BTS を使いたい場合には 国鉄 バンコク 駅≒MRTフアランポ, ネット通販サービスでは、 シアリス などのED治療薬の ジェネリック https://seiyokupiru.com/shiarisu-orijinaru-konyu/ 薬品, 期間限定プレミアムガチャに、ドレスアップした魔法科キャラクタ, ほかにはその変化なので、ずっと購入するかなと思っている方は. 将来的には週に3日は公務員をしながら、週2回は、例えば, その微振動のバストが本来の位置に定着させるために分かれていますが, でも、私は失敗しているので、その点を見つけてみると, 現在は、イクモアハーバルヘアケアジェルには頭皮の育毛成分を整え, この条件が重なっていなくても感染する事例はすでに確認されており.

After our ISWC 2009 paper on DBpedia, this is the (long overdue) new reference article for DBpedia, which should provide a good introduction to the project. We submitted the article as a system report to the Semantic Web journal.

Download article as PDF.

Posted in Xturtle | Leave a comment

KESW Conference in Saint Petersburg, Russia

This year, the 4th conference on Knowledge Engineering and Semantic Web (KESW) took place in Saint Petersburg from October 7 – 9. The week before that i.e. from September 30 – October 4, the KESW school was organized. Three researchers from our group, Amrapali Zaveri, Ivan Ermilov and Konrad Höffner were invited to the conference and to give talks at the school with the main focus on Linked Data. Furthermore we were invited to give lectures at the Higher School of Economics (HSE) in Moscow.

KESW School

On Day 2, Konrad introduced the students to the RDF data model, various serializations and Linked Open Vocabularies. The next day, Amrapali taught the basics of SPARQL to the students including a practical sessions on issuing SPARQL queries to DBpedia. On the last day, Ivan spoke about conversion of data to RDF, applications that use RDF and RDF data visualization. All the talks were presented using SlideWiki, in particular the slides in the Semantic Data Web lecture series.

The rest of the talks were focused towards Ontology Engineering covering topics such as ontology languages, semantics of description logics, ontology based data access etc. This year, however, there was a very limited number of students participating in the school. Also, some of them did not bring laptops along with them, which made it difficult to conduct practical sessions.

Lectures at HSE in Moscow

Between the KESW School and the KESW Conference, we were invited to Moscow to give lectures about basic Semantic Web topics at the Higher School of Economics (HSE) in Moscow. On Friday, October 4,  2013, we went there from Saint Petersburg by the Sapsan, which is a train by Siemens similar to the ICE (in Germany).

When we arrived in Moscow just four hours later, amazed by the sheer size of the city and everything within it, we went to our hotel to check in but there was some problem that was fortunately very quickly resolved by the energetic and friendly Irina, one of our hosts of University in Moscow, who came to our rescue and also helped us on our late-night quest to find a vegetarian restaurant.

Some pizza and beer later, we learned more about the modalities of our presentation. The audience was to expected to be unknowing about the Semantic Web and around 200 in number (in the end the audience was smaller however, I guess because the presentation was on an Saturday). This changed our plans from explaining the intricacies of some methodologies to presenting basic introductions to the Semantic Web and RDF along with a few use cases.

The day of the presentation, we went to the HSE lecture room which was professionally prepared with a camera, presentation-recording over HDMI, wireless microphones and a translator. Our lectures are now available on YouTube in English with Russian translations and even a Russian transcript: russian page with all videos, transcripts and materials, Google Translate version, just the Videos of Amrapali, Konrad and Ivan.

Amrapali was the first one and she talked about “Principles of Linked Data and Data Quality”. Having a translator took some getting used to but it really helped speaking slowly and clearly and also gave us some time to think about what we just said. I was also impressed by the amount Ivan Begtin could memorize and translate later in case I forgot to keep my sentences short and I also felt like speaking with him as a translator made the presentation more lively and interactive. The students were quite interested in our use cases, like the the conversion of open spending data to increase transparency in government spending and had many questions about them. Afterwards, our hosts thoughtfully invited us for dinner in a vegetarian restaurant which I was told was really good.

I (Konrad), however, met a friend who showed me a fantastic light show on the red square which takes place in Moscow several days a year. However, because the place was prepared for the passing of the olympic torch, it was really difficult to get around the maze of blockades until we found a security check point through which we could enter. The Bolshoi Theatre was projected in a way that cleverly took its architecture into account, one moment showing its top-down construction and deconstruction and the next moments having its horse statues running away. While this effect was novel and quite amazing for me (see http://www.youtube.com/watch?v=c1E9U5SqW_o), it didn’t have much variance, so after a while we went over to a different building which had kind of a computer graphics demo competition videos on it, of course also using the shape building but with more extravagant ideas.

On Sunday, our last day in Moscow, we wanted to show Amrapali the red square, as she had never been in Moscow before. Unfortunately, because of the ceremony of the Olympic flame, it was impossible to enter the red square from any direction so we did some sightseeing around it. We went back to Saint Petersburg early afternoon and practiced our slides for the conference.

KESW Conference

The KESW conference was the last event of our trip to Russia. The venue of the conference was located almost in the center of the city, in a hotel, but a bit far away from the KESW school venue. Inside the hotel the participants (overall around 60 people) resided in two rooms: one was for the presentations of the papers, keynotes and guest talks, while other one was for coffee breaks. Overall the conference lasted for three days.

On each day of the conference one of our researchers presented a system demo paper. On the first day Konrad described his User Interface for a Template Based Question Answering System, a paper about building user interface for question answering system. The live demo of his work is available online. On the second day Amrapali presented her TripleCheckMate: A Tool for Crowdsourcing the Quality Assessment of Linked Data, a paper about tool for checking the quality of Linked Data, in particular it was used in DBpedia Evaluation Campaign. And on the last day Ivan presented Linked Open Data Statistics: Collection and Exploitation, a paper about LODStats, tool for collecting statistics over the LOD Cloud.

At the end of the conference on the award ceremony a delightful event happened. Our Linked Open Data Statistics: Collection and Exploitation received the best system description award (see pic below).

Posted in Announcements | Leave a comment

AKSW Colloquium on Monday, October 21

In our weekly AKSW Colloquium, we present research, technologies and tools of the Semantic Web. The colloquium is open to the public and we welcome interested students, colleagues and industry partners to experience bleeding edge work-in-progress presentations and discussion rounds as well as talks by invited experts of our AKSW lecture series.

On Monday, October 21at 1.30 – 2.30 pm in Room P-702 (Paulinum), we will have presentations by Suresh Pokharel about Ontologies for farming in Nepal and by Didier Cherix about his master’s thesis about semi automatic error detection in ontologies (in german).

Furthermore, we would like to announce, that there is complimentary coffee and cake after the session. Bachelor and Master students will be able to get points for attendance.

“Ontology Based Data Access and Integration for Improving the Effectiveness of Farming in Nepal” by Suresh Pokharel, new PhD student

http://wiki.aksw.org/Colloquium/files?get=suresh_200.jpgI am Suresh Pokharel and I am studying at the University of Leipzig. My background is a Master of Engineering in Information and Communications Technologies (2008–2010) from Asian Institute of Technology, Thailand. I did my master’s thesis on the topic “Web Forum Mining based on User Satisfaction” under the supervision of Professor Sumanta Guha. I have an Bachelor of Engineering in Computer (2001–2005) from Pokhara University, Nepal. I taught (Part Time) Data Mining, Artificial Intelligence, Project works, Database etc. in Nepal College of Information Technology, Nepal since Sept 2010 to 29 Sept 2013.

In AKSW, I am working on the topic “Ontology Based Data Access and Integration for Improving the Effectiveness of Farming in Nepal”. The objective of this research is to integrate the agriculture related data (weather, crop, soil, geo-spatial data) with the help of semantic web technology for getting the richer agriculture related information.

Ontologiemetriken zur Datenqualitätsverbesserung von Didier Cherix, Masterarbeit

Ich studiere an der Universität Leipzig, wo ich meine Bachelorarbeit über die Generierung von SPARQL queries geschrieben habe.

Nun stelle ich meine Masterarbeit vor. Diese behandelt die Entwicklung eines semi-automatisierten Verfahrens zur Entdeckung von potentiellen Fehlern in einer Ontologie.
Um potentiell fehlerhafte Instanzen zu finden, werden die Werte der verschiedenen Properties analysiert und in Metriken erfasst. Mittels dieser Metriken werden die einzelnen Instanzen einer Klasse geclustert und somit versucht, Fehler zu entdecken.

Posted in Announcements, Colloquium, Events | Leave a comment

BIG at LSWT 2013 – From BIG Data to Smart Data

The 5th Leipziger Semantic Web Tag (LSWT2013) was organized as a meeting point for german as well as international Linked Data experts. Under the motto: From Big Data to Smart Data sophisticated methods that enable handling large amounts of data have been presented on September 23th in Leipzig. The keynote was held by Hans Uszkoreit, scientific director at the German Research Center for Artificial Intelligence (DFKI). By being introduced  to Text Analytics and Big Data issues the participants of the LSWT 2013 discussed the intelligent usage of huge amounts of data in the web.

Presentations on industrial and scientific solutions showed working solutions to big data concerns. Companies like Empolis, Brox and Ontos presented Linked Data and Semantic Web solutions capable of handling terabytes of data. However, also traditional approaches, like Datameer’s Data Analytics Solution based on Hadoop pointed out that big data could be handled nowadays without bigger problems.

Furthermore, problems detecting topics in massive data streams (Topic/S), document collections (WisARD) or corpora at information service providers (Wolters Kluwer) were tackled. Even the ethical issue of robots replacing journalists by the help of semantic data has been examined by Alexander Siebert from Retresco.

In conclusion, the analysis of textual information in large amounts of data is an interesting and so far not yet fully solved area of work. Further Information are available from the website.

Further information on topics related to data analysis, data curation, data storage, data acquisition and data usage can be found in our technical whitepaper available from our project website.

Posted in Data Quality, Events | Leave a comment

SlideWiki is now Open Source

We are pleased to announce that we have just released the SlideWiki.org source code under the permissive Apache open-source license. It is now available for download from the AKSW Github repository at:
https://github.com/AKSW/SlideWiki

The SlideWiki database dumps are also available at:
http://slidewiki.org/db/

SlideWiki.org is a platform for OpenCourseWare authoring and enables communities of educators to author, share and re-use multilingual educational content in a truly collaborative way. By completely open-sourcing SlideWiki and giving the community access to all the content we aim at:

  • Providing open access to crowdsourced e-learning material to be authored, shared and reused.
  • Collaborating with other open-source projects to improve the quality of SlideWiki implementation.
  • Inviting developers to openly contribute to SlideWiki and to write customized plugins and themes for SlideWiki.
  • Providing offline access to SlideWiki system.

To read more about SlideWiki features, see:
http://slidewiki.org/documentation

On behalf of SlideWiki team,
Ali Khalili, Darya Tarasowa and Sören Auer

Posted in Announcements, SlideWiki, Software Releases | Leave a comment

OntoWiki Feature of the Week: LOV Integration

This weeks presented feature is the new integration interface of the Linked Open Vocabularies (LOV) repository. LOV is a hand crafted repository of linked RDF/OWL vocabularies with well managed vocabulary meta data. We added a Vocabulary Selection module in order to allow searching and exploring of the LOV repository. This module allows for:

  • Create a Knowledge Base directly from a search result by importing the content from the web (create model screen)
  • Add the content of a specific vocabulary to an existing Knowledge Base (add data screen)
  • Add namespace prefixes based on a LOV search (configure model screen)

Search is done by a SPARQL query which uses a case-insensitive regular expression over all direct properties of the listed vocabularies.

The screenshot shows the module in action, in this case a result of a search for “Event”.

This LOV integration is currently available in the develop branch only and will be released with the next OntoWiki version.

Posted in Announcements, OntoWiki | Leave a comment

OntoWiki Feature of the Week: Document Management

This weeks presented feature is the files extension available at https://github.com/AKSW/files.ontowiki

An not so often requested feature of OntoWiki is the ability to not just talk about resources such as PDF documents and MP3 files, but also to manage these files inside OntoWiki. The files extension is exactly this. By enabling this extension, you can

  • Upload and download any file to your wiki
  • Annotate and explore uploaded files as any OntoWiki resource
  • Attach a file to an existing resource

This allows for use cases such as music database or bibliography management where your data is about concrete documents. In addition to that, this extension is useful if you already use the site extension and just want to upload some multimedia for your site.

Posted in Announcements, OntoWiki | Leave a comment

OntoWiki Feature of the Week: API Documentation

This week we would like to present the revised API reference documentation, hostet at http://api.ontowiki.net/.

The documentation collects informationen about classes of OntoWiki and the Erfurt library while leaving out other libraries (like Zend Framework etc.) to stay compact.

Next to the documentation for the master branch, you now are also able to look inside the current state at the develop branch.
While the master branch documentation keeps it simple, the develop branch documentation offers more information (hints, todos, etc.) for developers.

A special feature of the OntoWiki API documentation is the presentation of the triggered events including documentation and location.


For working offline you now can download the complete API reference documentation of each branch directly from http://api.ontowiki.net/ (see tab Downloads).

Posted in Announcements, OntoWiki | Leave a comment

AKSW wins I-Semantics' I-CHALLENGE 2013

We are very happy to announce that AKSW researchers won this year’s I-CHALLENGE with the paper “Linked Cancer Genome Atlas Database”, which was written by Muhammad Saleem, Shanmukha Sampath Padmanabhuni, Axel-Cyrille Ngonga Ngomo, Jonas S. Almeida, Stefan Decker and Helena F. Deus. In this paper, we present the triplification and linking of The Cancer Genome Atlas (TCGA) as well as applications of this triplification to targeted cancer treatment. The paper will be presented at this year’s I-Semantics and is available at http://goo.gl/1jIVK4. Come around and listen to the talk.
Regards from Leipzig,
Saleem and Axel on behalf of AKSW
Posted in Announcements | Leave a comment