What is a full text index ?
A full text index is an index of which words occur in a particular body of text, in this case papers dealing with Myxomycota. Full text indexes are well-known from the internet. Search engines, such as Google and Bing are in essence fultext indexes of words occuring in internet pages. Contrary to these search engines, this index only holds information from Scientific or popular papers on Myxomycota resulting in search results with less irrelevant hits.Which papers have been included ?
Schnittler & Mitchell (2000) estimated the number of publications focusing on Myxomycete to be approximately 3000. They explicitly pointed out, that this was a conservative estimate. Since then the number of papers have virtually exploded.Consequently the database cannot be considered anything near complete. New records are added regularly, but still some criterium of priority have to be used. As more recent literature is easier to find with the internet search engines, priority is given to older papers, however, without excluding new papers in any way. At present more than 3600 papers have been indexed. What is indexed ?
The index is a very simple one. The principles on which it is based, are the following:
- Only words with more than 3 characters have been indexed. This will inevitably sort out (few) important words. However, a limit is nescessary in order to avoid vasting resources on a huge number of trivial, small words. Selected three-letter words of particular importance have been included, however.
- The character set employed is unicode UTF-8. The original spelling of the papers has been preserved, however it has been attempted to 'normalize' accented characters in order to make it possible to search without accents. Thus f.ex., the city of 'Genève' may be found searching with an accented 'è' as well as with a non-accented 'e'.
- In order to make 'non-accented' searches substitute the following characters with with their two letter equivalents: æ and ä = ae, ø and ö = oe, and å = aa. Substitute all other accented characters with their one letter equivalent, f.ex. ç = c.
Most of the papers included in this project are protected by copyright. This means that the entire text of a particular paper cannot be published without the (written) consent from the copyright holders, i.e. author(s) and/or publisher.
The copyright rules applying to this website, are those of the European Union directives as adopted in Danish legislation. In short, a work (a paper in this case) is protected by copyright 70 years after the death of the author. In the case that a paper has more than one author, the rule applies to the author, who lives longest.
The rules thus allow the publication of papers no longer protected by copyright. In order to obey the rules it would be necessary to maintain records of authors dates. The datamodel, on which this search engine is based, have been prepared to handle such data, and a very small number of papers are already available as full text. More will be included as authors data have been recorded. This, however, has low priority.
The basic principle is that this is a full text indexing service - not a full text delivery service.Acknowledgements
H. Dissing † and N. H. Larsen, formerly Institute of Biology, and H. Knudsen and T. Læssøe, Natural History Museum, University of Copenhagen, have generously given me access to their private libraries and reprint collections. Their help is gratefully acknowledged.