University of Borås

Borås Academic Digital Archive (BADA) >
Forskningspublikationer / Research Publications >
Institutionen Biblioteks- och informationsvetenskap / Swedish School of Library and Information Science (BHS) >
Konferensbidrag / Conference papers (BHS) >

Please use this identifier to cite or link to this item: http://hdl.handle.net/2320/6609

Files in This Item:

File Description SizeFormat
Lendvaietal_DH2010_final.pdf208.58 kBAdobe PDFView/Open
Title: Propp Revisited: Integration of Linguistic Markup into Structured Content Descriptors of Tales
Authors: Lendvai, Piroska
Declerck, Thierry
Darányi, Sándor
Malec, Scott
Department: University of Borås. Swedish School of Library and Information Science
Issue Date: 7-Jul-2010
Journal Title: Proceedings of the Conference for Digital Humanities 2010
Media type: text
Publication type: conference paper, peer reviewed
Subject Category: Subject categories::HUMANITIES and RELIGION::Languages and linguistics::Linguistic subjects::Language technology
Subject categories::HUMANITIES and RELIGION::Other humanities and religion
Subject categories::INTERDISCIPLINARY RESEARCH AREAS::Cultural heritage and cultural production
Research Group: Digital Humanities Research Group
Area of Research: folktale analysis, language technology, semantic markup, motif analysis
Abstract: Metadata that serve as semantic markup, such as conceptual categories that describe the macrostructure of a plot in terms of actors and their mutual relationships, actions, and their ingredients annotated in folk narratives, are important additional resources of digital humanities research. Traditionally originating in structural analysis, in fairy tales they are called functions (Propp, 1968), whereas in myths – mythemes (Lévi-Strauss, 1955); a related, overarching type of content metadata is a folklore motif (Uther, 2004; Jason, 2000).In his influential study, Propp treated a corpus of tales in Afanas'ev's collection (Afanas'ev, 1945), establishing basic recurrent units of the plot ('functions'), such as Villainy, Liquidation of misfortune, Reward, or Test of Hero, and the combinations and sequences of elements employed to arrange them into moves.1 His aim was to describe the DNAlike structure of the magic tale sub-genre as a novel way to provide comparisons. As a start along the way to developing a story grammar, the Proppian model is relatively straightforward to formalize for computational semantic annotation, analysis, and generation of fairy tales. Our study describes an effort towards creating a comprehensive XML markup of fairy tales following Propp's functions, by an approach that integrates functional text annotation with grammatical markup in order to be used across text types, genres and languages. The Proppian fairy tale Markup Language (PftML) (Malec, 2001) is an annotation scheme that enables narrative function segmentation, based on hierarchically ordered textual content objects. We propose to extend PftML so that the scheme would additionally rely on linguistic information for the segmentation of texts into Proppian functions. Textual variation is an important phenomenon in folklore, it is thus beneficial to explicitly represent linguistic elements in computational resources that draw on this genre; current international initiatives also actively promote and aim to technically facilitate such integrated and standardized linguistic resources. We describe why and how explicit representation of grammatical phenomena in literary models can provide interdisciplinary benefits for the digital humanities research community. In two related fields of activities, we address the above as part of our ongoing activities in the CLARIN2 and AMICUS3 projects. CLARIN aims to contribute to humanities research by creating and recommending effective workflows using natural language processing tools and digital resources in scenarios where text-based research is conducted by humanities or social sciences scholars. AMICUS is interested in motif identification, in order to gain insight into higher-order correlations of functions and other content units in texts from the cultural heritage and scientific discourse domains. We expect significant synergies from their interaction with the PftML prototype.
URI: http://hdl.handle.net/2320/6609
Appears in Collections:Konferensbidrag / Conference papers (BHS)

SFX Query

All items in Borås Academic Digital Archive are protected by copyright, with all rights reserved.

 

DSpace Software Copyright © 2002-2010  The DSpace Foundation