When I wrote my last blog post 16 months ago I had not imagined to be part of the intergator team for so long. Nevertheless, following my internship I was not only offered a student job, but after that the opportunity to write my diploma thesis here. What had been the focus of my internship assignment[…]
Today’s customers want to use powerful search engines for their huge and increasing content repositories.
Full-text-only products with simple result lists are not enough to satisfy this community.
Different content sources require different analyzing and indexing strategies and a content-specific result presentation. There is a lot of research in the field of the application of semantic web technologies for information retrieval.
A wide range of useful standard vocabularies and powerful frameworks has been developed that can be used to gather, transform and store metadata. However, in practice we see a gap between state-of-the-art information retrieval and customer needs with a defined price-performance relation.
It is quite a challenge to index a large file server with heterogeneous content annotated with metadata from different vocabularies to provide an ontology-based navigation, to produce semantic annotated search results, to use faceted browsers as powerful filtering mechanism and do that with an out-of-the-box solution, which is stable, has a good performance and provides a simple way to configure it.