Evaluation Methods for Rankings of Facetvalues for Faceted Search

Geplaatst op 19-07-2011 door Anne | research | | Geen reacties »

A paper on Evaluation Methods for Rankings of Facetvalues for Faceted Search was accepted at the Conference on Multilingual and Multimodal Information Access Evaluation 2011.Below is the abstract:

We introduce two metrics aimed at evaluating systems that select facetvalues for a faceted search interface. Facetvalues are the values of meta-data fields in semi-structured data and are commonly used to refine queries. It is often the case that there are more facetvalues than can be displayed to a user and thus a selection has to be made. Our metrics evaluate these selections based on binary relevant assessments for the documents in a collection. Both our metrics are based on Normalized Discounted Cumulated Gain, an often used Information etrieval metric.

A pdf version of the paper can be found here. There is also a longer version with experiments available.

@inproceedings{schuth_evaluation_2011 ,
title = {Evaluation Methods for Rankings of Facetvalues for Faceted Search},
booktitle = {Proceedings of the Conference on Multilingual and Multimodal Information Access Evaluation 2011},
year = {2011},
publisher = {Springer},
author = {Schuth, A. and Marx, M.J.}
}

Samenwerking PoliticalMashup en NRC Den Haag

Geplaatst op 05-07-2011 door Maarten Marx | Uncategorized | | Geen reacties »

PoliticalMashup is een samenwerking begonnen met de Haagse afdeling van NRC Handelsblad.
Het eerste artikel, over de werkzaamheden van de oude en nieuwe Kamerleden in het eerste jaar van het Kabinet Rutte, verscheen op zaterdag 2 Juli. Alle feiten zijn terug te vinden op een speciale website: http://nrc.nl/denhaag/.

Politiekinzicht.com wint 3de prijs in Open Data Challenge.

Geplaatst op 29-06-2011 door Maarten Marx | Political Mashup, parliament | tags: | Geen reacties »

Drie eerstejaars UvA Informatiekunde studenten hebben met hun site politiekinzicht.com de 3de prijs in de visualisatie track van de Open Data Challenge gewonnen. Zij maakten een applicatie die zeer snel inzichtelijk maakt waar elke politicus in de Tweede Kamer over spreekt.

De prijs werd uitgereikt door Neelie Smit Kroes. In de jury zaten onder meer Sir Tim Berners Lee (W3C), Tom Lee (Sunlight Foundation) and Rufus Pollock (Open Knowledge Foundation).

De applicatie is gebaseerd op de Handelingen der Staten Generaal, die als Open Data beschikbaar zijn gemaakt in het PoliticalMashup project van het Informatica Instituut van de UvA. Verdere Informatie.

Over de prijs.

The Open Data Challenge was Europe’s biggest open data competition to date. There were 20,000 euros in prizes to win, and a total of 430 entries from 24 EU Member States. It was open for 60 days - from early April to early June 2011.

The winners were selected by an all star cast of open data gurus, and announced by Vice President of the European Commission Neelie Kroes at the Digital Agenda Assembly in Brussels.

Uit het jury rapport

We need better ways to understand our politicians, ways that go beyond catching a single quote to illuminate all of their commitments, interests and actions. That is why I really like this app.

David Eaves, Advisor to the Mayor of Vancouver


My favorite data visualisation was the dutch entry called “Politiek Inzicht”, which shows what members of parliament talk about, by visualising tag clouds for all individual speeches, reports and so on given by members of parliament. This is not only done in way which is very fun - it also provides valuable insight into the real political focus of each politician, allows for comparison between individuals within parties or across parties. When I explored this app, I immediately thought - “Thats what we need in Germany too!”.


Anke Domscheit-Berg, Government 2.0 Netzwerk Deutschland



University of Amsterdam XML Web Collection

Geplaatst op 07-06-2011 door Maarten Marx | XML, data | | Geen reacties »

Steven Grijzenhout made a collection of XML files crawled from the web available for research purposes.
The collection is available at http://data.politicalmashup.nl/sgrijzen/xmlweb/. A description of the data and an analysis of it is in the paper The Quality of the XML Web .

Politicologen Etmaal

Geplaatst op 06-06-2011 door Maarten Marx | lecture | | Geen reacties »

Vanuit PoliticalMashup zijn er dit jaar twee praatjes op het Politicologenetmaal, over de vermeende linksheid van de Nederlandse TV en over stemadvieshulpen op het web.
Hier zijn de bijbehorende slides:

Het praatje van Bart de Goede heeft de best presentation award binnen zijn sessie gewonnen. Een mooie prestatie voor een Bachelor student.

Voting Advice via Direct Access to the Relevant Data (Maarten Marx)

Slant on Dutch TV. Is TV language use really dominated by left? (Bart de Goede and Maarten Marx)

Informatics Institute colloquium

Geplaatst op 12-05-2011 door Maarten Marx | lecture, parliament, research | | Geen reacties »

May 24 at 16.00, Maarten Marx will give a talk at the Informatics Institute colloquium.

Location: Science Park 904, Room D1.113, Amsterdam
Title: Parliamentary Information Systems
Abstract:
The proceedings of national parliaments are fascinating material for information scientists.
For the Netherlands, they consist of 197 years of digitally available data. Apart from some modern gaps (see http://politicalmashup.nl/2011/03/uva-informatica-onderzoek-leidt-tot-kamervragen/), this datset is complete. We have similar complete datasets for the UK, Spain and the Flemish parliament (though for shorter periods).

Anachronistically we can describe the data as a multimedia, hyperlinked database consisting mostly of rich semi-structured text documents.
Within the PoliticalMashup project, UvA turns this anachronism into reality. This opens a wealth of new research possibilities situated in the emerging field of computational humanities.

In the talk we will both show the techniques used for the transformation and applications within the computational humanities.

Justin van Wees looked at the existing communities within the Informatics Institute by analyzing k-clique communities within the IvI co-author graph. In the attached diagrams we show the largest 3-clique community (66 nodes ) and the two largest 4-clique communities contained in it (24 en 18 nodes). Next we show the second and third largest 3-clique communities (7 en 4 nodes). There is one more 4-clique community but that was contained in the large component, so we did not show it:
ivi_network_3_en_4.pdf and ivi_network_3_en_4.svg (the small 4 node component drifted out of the picture here)

Talk Mattia Tomasoni May 10

Geplaatst op 02-05-2011 door Maarten Marx | Uncategorized | | Geen reacties »

Mattia Tomasoni visits ILPS on May 10 and 11. He might come to work with us as a PhD student. Mattia gives a talk on the topic of his 2010 ACL paper in an improvised ILPS seminar.
If you want to meet up with him, mail him at tomasonimattia@googlemail.com. He is with us the full two days.

Time and place: May 10, 10.30-11.15 Room A1.08.

Title: “Summarization in Yahoo! Answers”

Abstract:
“The objective of my MSc thesis was to automatically summarize information crawled from the Yahoo! Answers website with the purpose of generating trustful, complete, relevant and succinct summaries in response to users’ questions.

Unfortunately, information found online is often redundant, noisy and untrustworthy; interestingly, though, content generated by actual individuals (rather than published by an editor) contains metadata that can be exploited (i.e. machine learned) to overcome those very same difficulties!

To this end, my former supervisor and I devised four “metadata-aware measures for answer summarization”: Quality, Coverage, Relevance and Novelty. How they are defined, calculated, combined and finally evaluated will be the topic of my talk.”

Link to paper http://portal.acm.org/citation.cfm?id=1858759 (paper at ACL 2010)

Protected: Wie spreken in de Tweede Kamer? Per partij, en per zittingsperiode.

Geplaatst op 19-04-2011 door Maarten Marx | parliament, resultaten | | Enter your password to view comments

This post is password protected. To view it please enter your password below:


Protected: Elsevier en de Tweede Kamer

Geplaatst op 08-04-2011 door Maarten Marx | Political Mashup, research | | Enter your password to view comments

This post is password protected. To view it please enter your password below:


PhD position in Logic/XML/Trees

Geplaatst op 07-04-2011 door Maarten Marx | XML, research | tags: | 1 reactie »

The university of Amsterdam has a fully funded 4-year PhD position available. The research topic is on the interplay of logic, finite model theory, and the theory of (XML)-trees and motivated by a concrete problem in database research:

Data Exchange for Document Centric XML.

| lees verder…

« eerdere stukken latere stukken »