FACT wants to develop software enabling the computer to automatically enrich a corpus of Dutch folktales with metadata such as names, keywords, the genre, a summary and a catalogue number (or folktale type).
Research will focus on the question of whether automatic corpus analysis can offer new possibilities for the classification of folktales by means of clustering. The classification algorithms developed in the project will be integrated in user-friendly tools to support annotation and exploratory research of the folktale corpus. Using these tools, variability in oral and written transmission can be researched, as well as the pros and cons of human classification and computerized clustering.
Official project website: http://www.elab-oralculture.nl/fact