This class defines an interlinearized text, which consists of a
collection of Paragraph objects.
|
|
__init__(self,
file=None,
fm_line='ref',
fm_paragraph='id',
fm_morpheme='m',
fm_morpheme_gloss='g',
fm_word='w')
Constructor for Text object. |
source code
|
|
|
|
get_lines(self)
Obtain a list of line objects (ignoring paragraph structure). |
source code
|
|
|
|
get_paragraphs(self)
Obtain a list of paragraph objects. |
source code
|
|
|
|
|
|
|
getLineFM(self)
Get field marker that identifies a new line. |
source code
|
|
|
|
setLineFM(self,
lineHeadFieldMarker)
Change default field marker that identifies new line. |
source code
|
|
|
|
getParagraphFM(self)
Get field marker that identifies a new paragraph. |
source code
|
|
|
|
setParagraphFM(self,
paragraphHeadFieldMarker)
Change default field marker that identifies new paragraph. |
source code
|
|
|
|
getWordFM(self)
Get field marker that identifies word tier. |
source code
|
|
|
|
setWordFM(self,
wordFieldMarker)
Change default field marker that identifies word tier. |
source code
|
|
|
|
getMorphemeFM(self)
Get field marker that identifies morpheme tier. |
source code
|
|
|
|
setMorphemeFM(self,
morphemeFieldMarker)
Change default field marker that identifies morpheme tier. |
source code
|
|
|
|
getMorphemeGlossFM(self)
Get field marker that identifies morpheme gloss tier. |
source code
|
|
|
|
setMorphemeGlossFM(self,
morphemeGlossFieldMarker)
Change default field marker that identifies morpheme gloss tier. |
source code
|
|
|
|
|
|
|
set_file(self,
file)
Change file path set upon initialization. |
source code
|
|
|
|
parse(self)
Parse specified Shoebox file into Text object. |
source code
|
|
|
Inherited from corpora.toolbox.StandardFormat:
close,
fields,
open,
open_string,
raw_fields
Inherited from object:
__delattr__,
__getattribute__,
__hash__,
__new__,
__reduce__,
__reduce_ex__,
__repr__,
__setattr__,
__str__
|