DEXA 2007 Poland Pictures


4th International Workshop on Text-based Information Retrieval
in conjunction with DEXA 2007
Regensburg, Germany

Workshop Program

Monday, September 3rd, 2007

Session 1: Advanced Text Analysis and Document Models
09:30-10:00Author Identification Using Imbalanced and Limited Training Texts
E. Stamatatos [paper] [slides]
10:00-10:30Classifying XML Documents by Using Genre Features
M. Clark, S. Watt [paper] [slides]
10:30-11:00Generating a Topic Hierarchy from Dialect Texts
W. De Smet, M.-F. Moens [paper]
11:00-11:30Coffee break
Session 2: Software Engineering and Applications
11:30-12:00A System for Summary-Document Evaluation in Notary Domain
C. Cesarano, A. Mazzeo and A. Picariello [paper]
12:00-12:30An MDA Approach to Implement Personal IR Tools
S. Meyer zu Eissen, B. Stein [paper] [slides]
Session 3: Semantic Web and Annotation
14:00-14:30Aspects of Broad Folksonomies
M. Lux, M. Granitzer, R. Kern [paper] [slides]
14:30-15:00Apolda: A Practical Tool for Semantic Annotation
C. Wartena, R. Brussee, L. Gazendam, W.-O. Huijsen [paper] [slides]
15:00-15:30Automatic Annotation for Korean - Approach Based on the Contextual Exploration Method
H. Chai [paper]
15:30-16:00Coffee break
Session 4: Retrieval and Ranking
16:00-16:30Regression Relevance Models for Data Fusion
S. Wu [paper] [slides]
16:30-17:00Web Page Scoring based on a Link Analysis of Web Page Sets
H. Nakakubo, S. Nakajima, K. Hatano, J. Miyazaki, S. Uemura [paper] [slides]
17:00-17:30Hyperlink Classification: A New Approach to Improve PageRank
L. Cun-he, L. Ke-qiang [paper]

Call for Papers

Intelligent technologies for information mining and retrieval have become an important and exciting field of research in our information-flooded society. Methods of text-based information retrieval receive special attention, which results from the fundamental role of written text, from the high availability of the Internet, and from the rising importance of the different forms of Web communities.

Various techniques and methods are being used for text-based information retrieval tasks, which stem from different research areas: machine learning, computer linguistics and psychology, user interaction and modeling, information visualization, Web engineering, or distributed systems. The development of powerful retrieval tools requires the combination of these developments, and in this sense the workshop shall provide a platform that spans different views and approaches.

The following list gives examples from classic and ongoing topics from the field of text-based information retrieval for which contributions are welcome (but not restricted to):

  • formal models for text representation, document models, similarity measures for special retrieval tasks
  • category formation and clustering, document classification
  • IR and natural language processing: topic identification, text summarization, keyword extraction
  • Web community mining, social network analysis, collaborative tagging and IR
  • plagiarism analysis, author identification, style analysis
  • concepts and techniques for information visualization, user modeling, and interaction for particular retrieval tasks
  • relevance feedback and personalization
  • evaluation, building of test collections, experimental design and user studies
  • multilingual issues in IR: cross-language retrieval, multilingual retrieval, machine translation for IR
  • IR for the Semantic Web: usage, extraction, and maintenance of knowledge
  • IR and software engineering: frameworks, architectures, distributed IR
  • IR in business and engineering applications

The workshop addresses researchers, users, and practitioners from different fields: data mining and machine learning, document and knowledge management, semantic technologies, computer linguistics, and information retrieval in general. In particular, we encourage potential participants to present research prototypes and tools of their ideas.

Submission Guideline

Research papers submitted to TIR-07 must be original contributions. Contributions will be peer-reviewed by at least two experts from the related field. Accepted papers will be published as IEEE proceedings by IEEE CS Press.

Submissions must generally be in electronic form using the Portable Document Format (PDF) or Postscript. It is the responsibility of authors to ensure that their papers use no unusual format features and are printable on a standard Postscript printer.

Styleguide and Author Information

The camera ready papers are limited to 5 (five) pages in IEEE format (two columns in A4).

If your paper is accepted you will receive an "Author Kit" (formatting instructions etc.) from the IEEE in the first week of May 2007. Papers must be written in English. At least one author of each accepted paper must register for the conference and present the contribution.

Submit a Contribution

Please send your paper to

Important Dates

  • Apr 16, 2007, 24:00 (CET): Deadline for paper submission
  • Apr 28, 2007:Notification to authors
  • May 31, 2007: Camera-ready copy due
  • Sep 03, 2007: TIR-07 Workshop

Organizing Committee

  • Benno Stein (Co-Chair), Bauhaus University Weimar
  • Harald Sack (Co-Chair), University of Jena

Program Committee

  • Mikhail Alexandrov, Autonomous University of Barcelona
  • Elizabeth Sugar Boese, Colorado State University
  • Alexander Gelbukh, National Polytechnic Institute (IPN), Mexico
  • Michael Granitzer, Know-Center Graz
  • Udo Hahn, University of Jena
  • Siegfried Handschuh, National University of Ireland
  • Heiko Holzheuer, Lycos Europe
  • Andreas Hotho, University of Kassel
  • Wolfgang Kienreich, Know-Center Graz
  • Mathias Lux, University of Klagenfurt
  • Thomas Mandl, University of Hildesheim
  • Sven Meyer zu Eissen, Bauhaus University Weimar
  • Martin Potthast, Bauhaus University Weimar
  • Marina Santini, University of Brighton
  • Efstathios Stamatatos, University of the Aegean