Prof. Dr. Martin Theobald

Prof. Dr. Martin Theobald folgte einem Ruf an die Universtität Luxembourg und verließ daher die Universtität Ulm im Februar 2017.

Prof. Dr. Martin Theobald wurde im Februar 2015 von der Universität Ulm als ordentlicher Professor mit dem Schwerpunkt "Data Science" an das  Institut für Datenbanken und Informationssysteme berufen. Martin Theobald promovierte an der  Universität des Saarlandes im Jahr 2006 und verbrachte anschließend zwei Jahre als Postdoktorand am  Stanford Infolab (USA). Anschließend arbeitete er als Senior Researcher am  Max-Planck-Institut für Informatik in Saarbrücken, bevor er 2012 als assoziierter Professor an die Universität Antwerpen (Belgien) berufen wurde.

In seiner aktuellen Forschung beschäftigt sich Prof. Dr. Theobald  mit verteilten Datenbankarchitekturen und der skalierbaren Analyse großer Datenmengen. Ein thematischer Fokus liegt dabei auf dem Gebiet der Informationsextraktion sowie der Repräsentation und skalierbaren Auswertung der aus der Extraktion gewonnenen Daten mittels verteilter Graphdatenbanken. Weitere Forschungsthemen befassen sich mit der Konzeption und Entwicklung von temporalen und probabilistischen Datenbanksystemen. Als Dissertationsthema entwickelte Martin Theobald eine Suchmaschine für die ranglistenbasierte Auswertung  von Anfragen auf großen XML-Datensätzen („TopX“), wofür er unter anderem den Dissertationspreis des Fachbereichs für „Datenbanken und Informationssysteme“ der Gesellschaft für Informatik (GI) sowie eine ehrenhafte Erwähnung im Rahmen des Jim Gray Dissertation Awards der ACM erhielt. Die Forschungsgruppe am Max-Planck-Institut für Informatik erhielt 2010 einen fokussierten Forschungspreis von Google für ihre Arbeit im Bereich der robusten und skalierbaren Extraktion von semantischen Wissensbasen aus Textquellen.

Prof. Theobald agiert aktuell als Fachbereichseditor für Elseviers „Information Systems“ und als Gutachter für zahlreiche, international renommierte Fachzeitschriften, Konferenzen und Workshops.

Öffentliche Profile:

Aktuelle Kurse

Vorlesungen

Seminare und Praktika (jedes Semester)

Doktoranden und Gäste

Forschungsbesuch: Dr. Jinchuan Chen

Abgeschlossene Promotionen

Publikationen

| 2016 | 2014 | 2013 | 2012 | 2011 | 2010 | 2009 | 2008 | 2007 | 2006 | 2005 | 2004 | 2003 | 2002 |

2016

Nguyen, Dat Ba and Theobald, Martin and Weikum, Gerhard (2016) J-NERD: Joint Named Entity Recognition and Disambiguation with Rich Linguistic Features. TACL, ACL, Vol. 4, pp. 215-229.
Wang, Yafang and Ren, Zhaochun and Theobald, Martin and Dyla , Maximilian and de Melo, Gerard (2016) Summary Generation for Temporal Extractions. In: Database and Expert Systems Applications - 27th International Conference (DEXA 2016), Porto, Portugal, 5-9 September 2016, Lecture Notes in Computer Science 9827, Springer, pp. 370-386.

2014

Dylla, Maximilian and Theobald, Martin and Miliaraki, Iris (2014) Querying and Learning in Probabilistic Databases. In: Reasoning Web. Reasoning on the Web in the Big Data Era - 10th International Summer School 2014, Athens, Greece, September 8-13, 2014. Proceedings, Lecture Notes in Computer Science, Springer, Vol. 8714, pp. 313-368.
Gurajada, Sairam and Seufert, Stephan and Miliaraki, Iris and Theobald, Martin (2014) TriAD: a distributed shared-nothing RDF engine based on asynchronous message passing. In: International Conference on Management of Data (SIGMOD 2014), Snowbird, US, 22 - 27 June 2014, ACM, pp. 289-300.
Gurajada, Sairam and Seufert, Stephan and Miliaraki, Iris and Theobald, Martin (2014) Using Graph Summarization for Join-Ahead Pruning in a Distributed RDF Engine. In: Semantic Web Information Management on Semantic Web Information Management, ACM, pp. 41:1-41:4.
Melo, Andre and Theobald, Martin and Völker, Johanna (2014) Correlation-Based Refinement of Rules with Numerical Attributes. In: 27th International Florida Artificial Intelligence Research Society Conference (FLAIRS 2014) , Pensacola Beach, Florida, 21-23 May 2014, AAAI Press.
Nguyen, Dat Ba and Hoffart, Johannes and Theobald, Martin and Weikum, Gerhard (2014) AIDA-light: High-Throughput Named-Entity Disambiguation. In: Proceedings of the Workshop on Linked Data on the Web co-located with the 23rd International World Wide Web Conference (WWW 2014), Seoul, Korea, April 8, 2014., CEUR Workshop Proceedings, CEUR-WS.org, Vol. 1184.

2013

Bellot, Patrice and Doucet, Antoine and Geva, Shlomo and Gurajada, Sairam and Kamps, Jaap and Kazai, Gabriella and Koolen, Marijn and Mishra, Arunav and Moriceau, Véronique and Mothe, Josiane and Preminger, Michael and SanJuan, Eric and Schenkel, Ralf and Tannier, Xavier and Theobald, Martin and Trappett, Matthew and Wang, Qiuyue (2013) Overview of INEX 2013. In: CLEF Lab Reports.
Dylla, Maximilian and Miliaraki, Iris and Theobald, Martin (2013) A Temporal-Probabilistic Database Model for Information Extraction. PVLDB, 6(14): 1810-1821.
Dylla, Maximilian and Miliaraki, Iris and Theobald, Martin (2013) Top-k Query Processing in Probabilistic Databases with Non-Materialized Views. In: IEEE 29th International Conference on Data Engineering (ICDE 2013), Brisbane, Australia, IEEE Computer Society.
Gurajada, Sairam and Kamps, Jaap and Mishra, Arunav and Schenkel, Ralf and Theobald, Martin and Wang, Qiuyue (2013) Overview of the INEX 2013 Linked Data Track. In: CLEF (Online Working Notes/Labs/Workshop).
Mishra, Arunav and Gurajada, Sairam and Theobald, Martin (2013) SPAR-Key: Processing SPARQL-Fulltext Queries to Solve Jeopardy! Clues. In: CLEF (Online Working Notes /Labs / Workshop).
Theobald, Martin and DeRaedt, Luc and Kimmig, Angelika and Dylla, Maximilian and Miliaraki, Iris (2013) 10 Years of Probabilistic Querying — What Next?. In: 16th East-European Conference on Advances in Databases and Information Systems, LNCS, Springer.

2012

Bellot, Patrice and Chappell, Timothy and Doucet, Antoine and Geva, Shlomo and Kamps, Jaap and Kazai, Gabriella and Koolen, Marijn and Landoni, Monica and Marx, Maarten and Moriceau, Véronique and Mothe, Josiane and Ramirez Camps, Georgiana and Sanderson, Mark and SanJuan, Eric and Scholer, Falk and Tannier, Xavier and Theobald, Martin and Trappett, Matthew and Trotman, Andrew and Wang, Qiuyue (2012) Report on INEX 2011. ACM SIGIR Forum, 46(1): 33-42, ACM.
Dylla, Maximilian and Miliaraki, Iris and Theobald, Martin (2012) Top-k Query Processing in Probabilistic Databases with Non-Materialized Views. pp. 62, Technical Report MPI-I-2012-5-002, Max Planck Insititute for Informatics.
Hoffart, Johannes and Seufert, Stephan and Ba Nguyen, Dat and Theobald, Martin and Weikum, Gerhard (2012) KORE: keyphrase overlap relatedness for entity disambiguation. In: 21st ACM International Conference on Information and Knowledge Management (CIKM 2012), pp. 545-554.
Kim, Kwang In and Tompkin, James and Theobald, Martin and Kautz, Jan and Theobalt, Christian (2012) Match Graph Construction for Large Image Databases. In: 12th European Conference on Computer Vision (ECCV 2012), Lecture Notes in Computer Science, Springer, Vol. 7572, pp. 272-285.
Mishra, Arunav and Gurajada, Sairam and Theobald, Martin (2012) Design and evaluation of an IR-benchmark for SPARQL queries with fulltext conditions. In: Fifth ACM Workshop on Exploiting Semantic Annotations in Information Retrieval (ESAIR'12), ACM, pp. 9-10.
Mishra, Arunav and Gurajada, Sairam and Theobald, Martin (2012) Running SPARQL-fulltext queries inside a relational DBMS. In: CLEF 2012 Evaluation Labs and Workshop, Online Working Notes, pp. 1-13.
Nakashole, Ndapandula and Sozio, Mauro and Suchanek, Fabian and Theobald, Martin (2012) Query-time reasoning in uncertain RDF knowledge bases with soft and hard rules. In: Second International Workshop on Searching and Integrating New Web Data Sources, CEUR Workshop Proceedings, CEUR, Vol. 884, pp. 15-20.
Raschia, Guillaume and Theobald, Martin and Manolescu, Ioana (2012) Proceedings of the First International Workshop on Open Data (WOD-2012). CoRR, Vol. abs/12.
Wang, Qiuyue and Ramírez , Georgina and Marx, Maarten Marx and Theobald, Martin and Kamps, Jaap (2012) Overview of the INEX 2011 Data-Centric Track. In: 10th International Workshop of the Initiative for the Evaluation of XML Retrieval, Lecture Notes in Computer Science, Springer, Vol. 7424, pp. 118-137.
Wang, Qiuyue and Kamps, Jaap and Ramirez Camps, Georgina and Marx, Maarten and Schuth, Anne and Theobald, Martin and Gurajada, Sairam and Mishra, Arunav (2012) Overview of the INEX 2012 Linked Data Track. In: CLEF 2012 Evaluation Labs and Workshop, Online Working Notes, pp. 1-13.

2011

Adolphs, Peter and Theobald, Martin and Schäfer , Ulrich and Uszkoreit, Hans and Weikum, Gerhard (2011) YAGO-QA: Answering Questions by Structured Knowledge Queries. In: Fifth IEEE International Conference on Semantic Computing, IEEE, pp. 158-161.
Dylla, Maximilian and Sozio, Mauro and Theobald, Martin (2011) Resolving Temporal Conflicts in Inconsistent RDF Knowledge Bases. In: 14. Fachtagung Datenbanksysteme in Business, Technologie und Web (BTW 2011) ), Lecture Notes in Informatics, Bonner Köllen Verlag, Vol. 180, pp. 474-493.
Feld, Michael and Theobald, Martin and Stahl, Christoph and Meiser, Timm and Müller , Christian (2011) Generating Personalized Destination Suggestions for Automotive Navigation Systems under Uncertainty. In: 19th International Conference on User Modeling, Adaptation and Personalization, pp. 22-24.
Hose, Katja and Schenkel, Ralf and Theobald, Martin and Weikum, Gerhard (2011) Database foundations for scalable RDF processing. In: 7th International Summer School Semantic Technologies for the Web of Data, 2011, Lecture Notes in Computer Science, Springer, Vol. 6848, pp. 202-249.
Meiser, Timm and Dylla, Maximilian and Theobald, Martin (2011) Interactive Reasoning in Uncertain RDF Knowledge Bases. In: ACM International Conference on Information and Knowledge Management (CIKM 2011), pp. 2557-2560.
Nakashole, Ndapandula and Theobald, Martin and Weikum, Gerhard (2011) Scalable Knowledge Harvesting with High Precision and High Recall. In: 4th ACM International Conference on Web Search and Data Mining, ACM, pp. 227-236.
Yahya, Mohamed and Theobald, Martin (2011) D2R2: Disk-oriented Deductive Reasoning in a RISC-style RDF Engine. In: 5th International Symposium on Rule-Based Modeling and Computing on the Semantic Web (RuleML 2011), Lecture Notes in Computer Science, Springer, Vol. 7018, pp. 81-96.

2010

Alonso, Omar and Schenkel, Ralf and Theobald, Martin (2010) Crowdsourcing Assessments for XML Ranked Retrieval. In: 32nd European Conference on Advances in Information Retrieval Research (ECIR 2010), Lecture Notes in Computer Science, Springer, Vol. 5993, pp. 602-606.
Beckers, Thomas and Bellot, Patrice and Demartini, Gianluca and Denoyer, Ludovic and de Vries, Christopher M. and Doucet, Antoine and Fachry, Khairun Nisa and Fuhr, Norbert and Gallinari, Patrick and Geva, Shlomo and Huang, Wei-Che and Iofciu, Tereza and Kamps, Jaap and Kazai, Gabriella and Koolen, Marijn and Kutty, Sangeetha and Landoni, Monica and Lehtonen, Miro and Moriceau, Véronique and Nayak, Richi and Nordlie, Ragnar and Pharo, Nils and SanJuan, Eric and Schenkel, Ralf and Tannier, Xavier and Theobald, Martin and Thom, James A. and Trotman, Andrew and de Vries, Arjen P. (2010) Report on INEX 2009. SIGIR Forum, 44(1): 38-57, ACM.
Das Sarma, Anish and Theobald, Martin and Widom, Jennifer (2010) LIVE: A Lineage-Supported Versioned DBMS. pp. 13, Technical Report ILPUBS-926, Stanford University.
Das Sarma, Anish and Theobald, Martin and Widom, Jennifer (2010) LIVE: A Lineage-Supported Versioned DBMS. In: 22nd International Conference Scientific and Statistical Database Management, Lecture Notes in Computer Science, Springer, Vol. 6187, pp. 416-433.
Lauw, Hady W. and Schenkel, Ralf and Suchanek, Fabian and Theobald, Martin and Weikum, Gerhard (2010) Harvesting Knowledge from Web Data and Text. In: 19th International Conference on Information and Knowledge Management (CIKM 2010), pp. 1-6.
Nakashole, Ndapandula and Theobald, Martin and Weikum, Gerhard (2010) Find your Advisor: Robust Knowledge Gathering from the Web. In: 13th International Workshop on the Web and Databases, pp. 1-6.
Sonntag, Daniel and Theobald, Martin (2010) Explanations in Dialogue Systems through Uncertain RDF Knowledge Bases. In: 5th International Workshop on Explanation-Aware Computing, CEUR Workshop Proceedings, CEUR-WS.org, Vol. 650, pp. 1-12.
Theobald, Martin and Sozio, Mauro and Suchanek, Fabian and Nakashole, Ndapandula (2010) URDF: Efficient Reasoning in Uncertain RDF Knowledge Bases with Soft and Hard Rules. pp. 48, Technical Report MPI-I-2010-5-002, Max-Planck-Institut für Informatik.
Theobald, Martin and Aji, Ablimit and Schenkel, Ralf (2010) TopX 2.0 at the INEX 2009 Ad-Hoc and Efficiency Tracks. In: 8th International Workshop of the Initiative for the Evaluation of XML Retrieval, Lecture Notes in Computer Science, Springer, Vol. 6203, pp. 218-228.
Wang, Yafang and Yahya, Mohamed and Theobald, Martin (2010) Time-aware Reasoning in Uncertain Knowledge Bases. In: 4th International VLDB Workshop on Management of Uncertain Data, Vol. WP 10-, pp. 51-65.
Weikum, Gerhard and Theobald, Martin (2010) From Information to Knowledge: Harvesting Entities and Relationships from Web Sources. In: 29th ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS 2010), pp. 65-76.

2009

Demartini, Gianluca and Denoyer, Ludovic and Doucet, Antoine and Fachry, Khairun Nisa and Gallinari, Patrick and Geva, Shlomo and Huang, Wei-Che and Iofciu, Tereza and Kamps, Jaap and Kazai, Gabriella and Koolen, Marijn and Landoni, Monica and Nordlie, Ragnar and Pharo, Nils and Schenkel, Ralf and Theobald, Martin and Trotman, Andrew and de Vries, Arjen P. and Woodley, Alan and Zhu, Jianhan (2009) Report on INEX 2008. SIGIR Forum, 43(1): 17-36, ACM.
Marian, Amélie and Schenkel, Ralf and Theobald, Martin (2009) Ranked XML Processing. Springer, In: Encyclopedia of Database Systems. pp. 2325-2332.
Schenkel, Ralf and Theobald, Martin (2009) Integrated DB&IR Semi-Structured Text Retrieval. Springer, In: Encyclopedia of Database Systems. pp. 1543-1546.
Schenkel, Ralf and Theobald, Martin (2009) Overview of the INEX 2009 Efficiency Track. In: 8th International Workshop of the Initiative for the Evaluation of XML Retrieval, pp. 200-212.
Theobald, Martin and Shah, Nigam and Shrager, Jeff (2009) Extraction of Conditional Probabilities of the Relationships Between Drugs, Diseases, and Genes from PubMed Guided by Relationships in PharmGKB. In: AMIA Summit on Translational Bioinformatics, pp. 124-128.
Whang, Steven Euijong and Menestrina, David and Koutrika, Georgia and Theobald, Martin and Garcia-Molina, Hector (2009) Entity Resolution with Iterative Blocking. In: International Conference on Management of Data & 28th Symposium on Principles of Database Systems (SIGMOD-PODS'09), pp. 219-232.

2008

Benjelloun, Omar and Das Sarma, Anish and Halevy, Alon Y. and Theobald, Martin and Widom, Jennifer (2008) Databases with uncertainty and lineage. The VLDB Journal, 17(1): 243-264, Springer.
Broschart, Andreas and Schenkel, Ralf and Theobald, Martin (2008) Proximity-Aware Scoring for XML Retrieval. In: 7th International Workshop of the Initiative for the Evaluation of XML Retrieval, pp. 46-49.
Das Sarma, Anish and Theobald, Martin and Widom, Jennifer (2008) Data Modifications and Versioning in Trio. pp. 14, Technical Report ILPUBS-849, Stanford University.
Das Sarma, Anish and Theobald, Martin and Widom, Jennifer (2008) Exploiting Lineage for Confidence Computation in Uncertain and Probabilistic Databases. In: 24th International Conference on Data Engineering (ICDE 2008), IEEE Computer Society Press, pp. 1023-1032.
Das Sarma, Anish and Deshpande, Amol and Hubauer, Thomas and Ilyas, Ihab F. and König-Ries , Birgitta and Renz, Matthias and Theobald, Martin (2008) Working Group Report: Lineage/Provenance. In: Uncertainty Management in Information Systems, Dagstuhl Seminar Proceedings 08421, pp. 08421:1-08421:5.
Kandel, Sean and Abelson, Eric and Garcia-Molina, Hector and Paepcke, Andreas and Theobald, Martin (2008) Photospread: a spreadsheet for managing photos. In: Conference on Human Factors in Computing Systems, pp. 1749-1758.
Theobald, Martin and Schenkel, Ralf (2008) Overview of the INEX 2008 Efficiency Track. In: 7th International Workshop of the Initiative for the Evaluation of XML Retrieval, pp. 179-191.
Theobald, Martin and Siddharth, Jonathan and Paepcke, Andreas (2008) SpotSigs: robust and efficient near duplicate detection in large web collections.. In: 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 563-570.
Theobald, Martin and AbuJarour, Mohammed and Schenkel, Ralf (2008) TopX 2.0 at the INEX 2008 Efficiency Track. In: 7th International Workshop of the Initiative for the Evaluation of XML Retrieval, pp. 224-236.
Theobald, Martin and Bast, Holger and Majumdar, Debapriyo and Schenkel, Ralf and Weikum, Gerhard (2008) TopX: Efficient and Versatile Top-k Query Processing for Semistructured Data. The VLDB Journal, 17(2): 81-115, Springer.

2007

Broschart, Andreas and Schenkel, Ralf and Theobald, Martin and Weikum, Gerhard (2007) TopX @ INEX 2007. In: 6th International Workshop of the Initiative for the Evaluation of XML Retrieval, Springer, pp. 49-56.
Graupmann, Jens and Biwer, Michael and Zimmer, Christian and Zimmer, Patrick and Bender, Matthias and Theobald, Martin and Weikum, Gerhard (2007) COMPASS: A Concept-Based Web Search Engine for HTML, XML, and Deep Web Data. The Icfai University Press, In: Dynamics of Search Engines: An Introduction. pp. 193-203.
Kandel, Sean and Paepcke, Andreas and Theobald, Martin and Garcia-Molina, Hector (2007) The PhotoSpread Query Language. pp. 14, Technical Report DBPUBS-812, Stanford University.
Mutsuzaki, Michi and Theobald, Martin and de Keijzer, Ander and Widom, Jennifer and Agrawal, Parag and Benjelloun, Omar and Das Sarma, Anish and Murthy, Raghotham and Sugihara, Tomoe (2007) Trio-One: Layering Uncertainty and Lineage on a Conventional DBMS. In: Third Biennial Conference on Innovative Data Systems Research, pp. 269-274.
Schenkel, Ralf and Broschart, Andreas and Hwang, Seungwon and Theobald, Martin and Weikum, Gerhard (2007) Efficient Text Proximity Search. In: 14th String Processing and Information Retrieval Symposium, Lecture Notes in Computer Science, Springer, Vol. 4726, pp. 287-299.
Theobald, Martin and Schenkel, Ralf and Weikum, Gerhard (2007) The TopX DB&IR Engine (Demo). In: ACM SIGMOD International Conference on Management of Data (SIGMOD 2007), pp. 1141-1143.
Theobald, Martin and Broschart, Andreas and Schenkel, Ralf and Solomon, Silvana and Weikum, Gerhard (2007) TopX - Adhoc Track and Feedback Task. In: 5th International Workshop of the Initiative for the Evaluation of XML Retrieval, Lecture Notes in Computer Science, Springer, Vol. 4518, pp. 233-242.
Theobald, Martin and Schenkel, Ralf and Weikum, Gerhard (2007) TopX - Efficient and Versatile Top-k Query Processing for Text, Semistructured, and Structured Data. In: 12. GI-Fachtagung Datenbanksysteme in Business, Technologie und Web (BTW 2007), Vol. 103, pp. 475-485.

2006

Bast, Holger and Majumdar, Debapriyo and Schenkel, Ralf and Theobald, Martin and Weikum, Gerhard (2006) IO-Top-k at TREC 2006: Terabyte Track. In: 15th Text Retrieval Conference, NIST, pp. 551-555.
Bast, Holger and Majumdar, Debapriyo and Schenkel, Ralf and Theobald, Martin and Weikum, Gerhard (2006) IO-Top-k: Index-access Optimized Top-k Query Processing. pp. 43, Technical Report MPI-I-2006-5-002, Saarland University.
Bast, Holger and Majumdar, Debapriyo and Schenkel, Ralf and Theobald, Martin and Weikum, Gerhard (2006) IO-Top-k: Index-access Optimized Top-k Query Processing. pp. 43, Technical Report DELIS-TR-0323, University of Paderborn.
Bast, Holger and Majumdar, Debapriyo and Schenkel, Ralf and Theobald, Martin and Weikum, Gerhard (2006) IO-Top-k: Index-access Optimized Top-k Query Processing. In: 32nd International Conference on Very Large Data Bases (VLDB 2006), pp. 475-486.
Schenkel, Ralf and Theobald, Martin (2006) Feedback-Driven Structural Query Expansion for Ranked Retrieval of XML Data. In: 10th International Conference on Extending Database Technology (EDBT 2006), Lecture Notes in Computer Science, Springer, Vol. 3896, pp. 331-348.
Schenkel, Ralf and Theobald, Martin (2006) Relevance Feedback for Structural Query Expansion. In: 4th International Workshop of the Initiative for the Evaluation of XML Retrieval, Lecture Notes in Computer Science, Springer, Vol. 3977, pp. 344-357.
Schenkel, Ralf and Theobald, Martin (2006) Structural Feedback for Keyword-Based XML Retrieval. In: 28th European Conference on IR Research, Lecture Notes in Computer Science, Springer, Vol. 3936, pp. 326-337.
Theobald, Martin (2006) Efficient Top-k Query Processing for Text, Semistructured, and Structured Data. Phd thesis, Saarland University.
Theobald, Martin and Schenkel, Ralf and Weikum, Gerhard (2006) TopX & XXL at INEX 2005 (Ad-Hoc Track). In: 4th International Workshop of the Initiative for the Evaluation of XML Retrieval, Lecture Notes in Computer Science, Springer, Vol. 3977, pp. 282-295.
Theobald, Martin and Broschart, Andreas and Schenkel, Ralf and Solomon, Silvana and Weikum, Gerhard (2006) TopX — AdHoc Track and Feedback Task. In: 5th International Workshop of the Initiative for the Evaluation of XML Retrieval, pp. 140-149.

2005

Ifrim, Georgiana and Theobald, Martin and Weikum, Gerhard (2005) Learning Word-to-Concept Mappings for Automatic Text Classification. In: 22nd International Conference on Machine Learning, pp. 18-26.
Mavroeidis, Dimitrios and Tsatsaronis, George and Vazirgiannis, Michalis and Theobald, Martin and Weikum, Gerhard (2005) Word Sense Disambiguation for Exploiting Hierarchical Thesauri in Text Classification. In: 9th European Conference on Principles and Practice of Knowledge Discovery in Databases Knowledge discovery in databases (PKDD 2005) , Lecture Notes in Computer Science, Springer, Vol. 3721, pp. 181-192.
Theobald, Martin and Schenkel, Ralf and Weikum, Gerhard (2005) An Efficient and Versatile Query Engine for TopX Search. In: 31st International Conference on Very Large Data Bases (VLDB 2005), pp. 625-636.
Theobald, Martin and Schenkel, Ralf and Weikum, Gerhard (2005) Efficient and Self-Tuning Incremental Query Expansion for Top-k Query Processing. In: 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 242-249.

2004

Graupmann, Jens and Biwer, Michael and Zimmer, Christian and Zimmer, Patrick and Bender, Matthias and Theobald, Martin and Weikum, Gerhard (2004) COMPASS: A Concept-based Web Search Engine for HTML, XML, and Deep Web Data. In: 30th International Conference on Very Large Databases (VLDB 2004), Morgan Kaufmann, pp. 1313-1316.
Theobald, Martin and Weikum, Gerhard and Schenkel, Ralf (2004) Top-k Query Evaluation with Probabilistic Guarantees. In: 30th International Conference on Very Large Databases (VLDB 2004), pp. 648-659.
Theobald, Martin and Klas, Claus-Peter (2004) BINGO! and Daffodil: Personalized Exploration of Digital Libraries and Web Sources. In: RIAO 2004, pp. 347-365.
Weikum, Gerhard and Graupmann, Jens and Schenkel, Ralf and Theobald, Martin (2004) Towards a Statistically Semantic Web. In: 23rd International Conference on Conceptual Modeling (ER 2004), Lecture Notes in Computer Science, Springer, Vol. 3288, pp. 3-17.

2003

Graupmann, Jens and Sizov, Sergej and Theobald, Martin (2003) From Focused Crawling to Expert Information: an Application Framework for Web Exploration and Portal Generation. In: 29th International Conference on Very Large Data Bases (VLDB 2003), Morgan Kaufmann, pp. 1105-1108.
Sizov, Sergej and Theobald, Martin and Siersdorfer, Stefan and Weikum, Gerhard and Graupmann, Jens and Biwer, Michael and Zimmer, Patrick (2003) The BINGO! System for Information Portal Generation and Expert Web Search. In: Conference on Innovative Data Systems Research (CIDR 2003), 5-8 January 2003, pp. 69-80.
Theobald, Martin and Schenkel, Ralf and Weikum, Gerhard (2003) Classification and Focused Crawling for Semistructured Data. Lecture Notes in Computer Science, Springer, In: Intelligent Search on XML Data: Applications, Languages, Models, Implementations, and Benchmarks. Vol. 2818, pp. 145-157.
Theobald, Martin and Schenkel, Ralf and Weikum, Gerhard (2003) Exploiting Structure, Annotation, and Ontological Knowledge for Automatic Classification of XML Data. In: 6th International Workshop on the Web and Databases, pp. 1-6.

2002

Sizov, Sergej and Theobald, Martin and Siersdorfer, Stefan and Weikum, Gerhard (2002) BINGO!: Bookmark-Induced Gathering of Information.. In: 3rd International Conference on Web Information Systems Engineering, IEEE Computer Society, pp. 323-332.
Sizov, Sergej and Siersdorfer, Stefan and Theobald, Martin and Weikum, Gerhard (2002) The BINGO! Focused Crawler: From Bookmarks to Archetypes.. In: 18th International Conference on Data Engineering (ICDE 2002), IEEE Computer Society, pp. 337-338.
Theobald, Martin and Siersdorfer, Stefan and Sizov, Sergej (2002) BINGO! Ein thematisch fokussierender Crawler zur Generierung personalisierter Ontologien. In: 32. Jahrestagung der Gesellschaft für Informatik (GI 2002), Vol. 19, pp. 146-150.

Kontakt

Prof. Dr. Martin Theobald

Stellv. Institutsdirektor

martin.theobald(at)uni-ulm.de