Informatics Institute colloquium

Geplaatst op 12-05-2011 door Maarten Marx | lecture, parliament, research | | comment image Geen reacties »

May 24 at 16.00, Maarten Marx will give a talk at the Informatics Institute colloquium.

Location: Science Park 904, Room D1.113, Amsterdam
Title: Parliamentary Information Systems
The proceedings of national parliaments are fascinating material for information scientists.
For the Netherlands, they consist of 197 years of digitally available data. Apart from some modern gaps (see, this datset is complete. We have similar complete datasets for the UK, Spain and the Flemish parliament (though for shorter periods).

Anachronistically we can describe the data as a multimedia, hyperlinked database consisting mostly of rich semi-structured text documents.
Within the PoliticalMashup project, UvA turns this anachronism into reality. This opens a wealth of new research possibilities situated in the emerging field of computational humanities.

In the talk we will both show the techniques used for the transformation and applications within the computational humanities.

Justin van Wees looked at the existing communities within the Informatics Institute by analyzing k-clique communities within the IvI co-author graph. In the attached diagrams we show the largest 3-clique community (66 nodes ) and the two largest 4-clique communities contained in it (24 en 18 nodes). Next we show the second and third largest 3-clique communities (7 en 4 nodes). There is one more 4-clique community but that was contained in the large component, so we did not show it:
ivi_network_3_en_4.pdf and ivi_network_3_en_4.svg (the small 4 node component drifted out of the picture here)

Talk Mattia Tomasoni May 10

Geplaatst op 02-05-2011 door Maarten Marx | Uncategorized | | comment image Geen reacties »

Mattia Tomasoni visits ILPS on May 10 and 11. He might come to work with us as a PhD student. Mattia gives a talk on the topic of his 2010 ACL paper in an improvised ILPS seminar.
If you want to meet up with him, mail him at He is with us the full two days.

Time and place: May 10, 10.30-11.15 Room A1.08.

Title: “Summarization in Yahoo! Answers”

“The objective of my MSc thesis was to automatically summarize information crawled from the Yahoo! Answers website with the purpose of generating trustful, complete, relevant and succinct summaries in response to users’ questions.

Unfortunately, information found online is often redundant, noisy and untrustworthy; interestingly, though, content generated by actual individuals (rather than published by an editor) contains metadata that can be exploited (i.e. machine learned) to overcome those very same difficulties!

To this end, my former supervisor and I devised four “metadata-aware measures for answer summarization”: Quality, Coverage, Relevance and Novelty. How they are defined, calculated, combined and finally evaluated will be the topic of my talk.”

Link to paper (paper at ACL 2010)