Uses of Interface
edu.upf.taln.dri.lib.model.Document
-
Packages that use Document Package Description edu.upf.taln.dri.lib edu.upf.taln.dri.lib.loader edu.upf.taln.dri.lib.model edu.upf.taln.dri.lib.model.util edu.upf.taln.dri.lib.model.util.serializer -
-
Uses of Document in edu.upf.taln.dri.lib
Methods in edu.upf.taln.dri.lib that return Document Modifier and Type Method Description static Document
Factory. createNewDocument()
Instantiates a newDocument
, without any data loaded.static Document
Factory. createNewDocument(File file)
Instantiates a newDocument
, populating it with the XML string-serialized contents read from a file.static Document
Factory. createNewDocument(String absoluteFilePath)
Instantiates a newDocument
, populating it with the XML string-serialized contents read from a file. -
Uses of Document in edu.upf.taln.dri.lib.loader
Methods in edu.upf.taln.dri.lib.loader that return Document Modifier and Type Method Description Document
JATSloader. parseJATS(byte[] JATSbyteArray, String JATSfileName)
Convert a JATS XML file to aDocument
Document
JATSloader. parseJATS(File file)
Convert a JATS XML file to aDocument
Document
JATSloader. parseJATS(String absoluteFilePath)
Convert a JATS XML file to aDocument
Document
JATSloader. parseJATS(URL url)
Convert a JATS XML file to aDocument
Document
JATSloaderImpl. parseJATS(byte[] JATSbyteArray, String JATSfileName)
Document
JATSloaderImpl. parseJATS(File file)
Document
JATSloaderImpl. parseJATS(String absoluteFilePath)
Document
JATSloaderImpl. parseJATS(URL url)
Document
GROBIDloaderImpl. parsePDF(byte[] PDFbyteArray, String PDFfileName)
Document
GROBIDloaderImpl. parsePDF(File file)
Document
GROBIDloaderImpl. parsePDF(String absoluteFilePath)
Document
GROBIDloaderImpl. parsePDF(URL url)
Document
PDFEXTloaderImpl. parsePDF(byte[] PDFbyteArray, String PDFfileName)
Document
PDFEXTloaderImpl. parsePDF(File file)
Document
PDFEXTloaderImpl. parsePDF(String absoluteFilePath)
Document
PDFEXTloaderImpl. parsePDF(URL url)
Document
PDFloader. parsePDF(byte[] PDFbyteArray, String PDFfileName)
Convert a PDF file to aDocument
Document
PDFloader. parsePDF(File file)
Convert a PDF file to aDocument
Document
PDFloader. parsePDF(String absoluteFilePath)
Convert a PDF file to aDocument
Document
PDFloader. parsePDF(URL url)
Convert a PDF file to aDocument
Document
PDFXloaderImpl. parsePDF(byte[] PDFbyteArray, String PDFfileName)
Document
PDFXloaderImpl. parsePDF(File file)
Document
PDFXloaderImpl. parsePDF(String absoluteFilePath)
Document
PDFXloaderImpl. parsePDF(URL url)
Document
PlainTextLoader. parsePlainText(File file)
Convert a plain text to aDocument
IMPORTANT: the text should be UTF-8 encodedDocument
PlainTextLoader. parsePlainText(String absoluteFilePath)
Convert a plain text to aDocument
IMPORTANT: the text should be UTF-8 encodedDocument
PlainTextLoader. parsePlainText(URL url)
Convert a plain text to aDocument
IMPORTANT: the text should be UTF-8 encodedDocument
PlainTextLoaderImpl. parsePlainText(File file)
Document
PlainTextLoaderImpl. parsePlainText(String absoluteFilePath)
Document
PlainTextLoaderImpl. parsePlainText(URL url)
Document
PlainTextLoader. parseString(String textToLoad, String docName)
Convert a plain text to aDocument
IMPORTANT: the text should be UTF-8 encodedDocument
PlainTextLoaderImpl. parseString(String textToLoad, String docName)
-
Uses of Document in edu.upf.taln.dri.lib.model
Classes in edu.upf.taln.dri.lib.model that implement Document Modifier and Type Class Description class
DocumentImpl
IMPORTANT: Never instantiate directly this class!
To get an instance of a Document by theDocument interface
, you have always to use one of theFactory
methods:
-Factory.createNewDocument()
-Factory.createNewDocument(String absoluteFilePath)
-Factory.createNewDocument(File file)
-Factory.getEmptyDocument()
-
Uses of Document in edu.upf.taln.dri.lib.model.util
Methods in edu.upf.taln.dri.lib.model.util with parameters of type Document Modifier and Type Method Description static String
RDFparse. getDocRDF(Document doc)
Generate RDF dataset from a documentstatic String
DocParse. getDocumentROSasCSVstring(Document doc, SentenceSelectorENUM sentenceSelector)
This method generates "NODE -> SUBJECT -> VERB_NODE" and "NODE -> OBJECT -> VERB_NODE" and "CAUSE_NODE -> CAUSE -> EFFECT_NODE" triples.
Because of coreference resolution, the non verb NODES (subjects and objects of SUBJECT / OBJECT triples) can represent the aggregation, performed by means of a coreference resolutor, of several nodes coming from several sentence.
In case a non verb NODE is the rsult of the aggregation of nodes by coreference resolutions, the text / name of the NODE is the sequence of the texts / names of the nodes aggregated, separated by three underscores (name1___name2___name3).static String
DocParse. getSentencesCSVstring(Document doc, SentenceSelectorENUM sentenceSelector)
This method will return a CSV with a number of rows equal to the number of sentences in the document.
The first CSV column is the document-unambiguous ID of the sentence as referred by the field NODE_SENT_ID of the ROS CSV of the ROS CSV generated by the method getDocumentROSasCSVstring(edu.upf.taln.dri.lib.model.Document doc, SentenceSelectorENUM sentenceSelector) of this class.
The second column is the space separated list of tokens of the sentence.
The third column is the Rhetorical Class of the sentence.
The fourth column is the name of the section that contains the sentence (if any).
The fifth column is the nesting level of the section that contains the sentence (if any).static String
DocParse. getTokenExpressionCSVstring(Document doc, SentenceSelectorENUM sentenceSelector)
This method will return a CSV withstatic String
DocParse. getTokenROSasCSVstring(Document doc, SentenceSelectorENUM sentenceSelector, DocGraphTypeENUM graphType)
Deprecated.IMPORTANT: this method is deprecated in favor of the methodgetDocumentROSasCSVstring
that generates a more compact and useful ROS -
Uses of Document in edu.upf.taln.dri.lib.model.util.serializer
Methods in edu.upf.taln.dri.lib.model.util.serializer with parameters of type Document Modifier and Type Method Description static String
JSONgenerator. getDocumentJSON(Document doc, boolean includeTokens, boolean includeBabelnetSynsets)
Get the JSON serialization of the information contained in the documentstatic String
JSONgenerator. getSentenceJSON(Document doc, Integer sentenceId, boolean includeTokens, boolean includeBabelnetSynsets)
Return the JSON serialization of a sentence of the document (JSON object), identified by its id.
The sentence JSON object is compliant to the structure modeled by theSentenceJSON
class.static String
JSONgenerator. getSentencesJSON(Document doc, SentenceSelectorENUM sentenceSel, boolean includeTokens, boolean includeBabelnetSynsets)
Return the JSON serialization of a set of sentences of the document.
A list of sentence JSON object is returned; the order of the elements of this list reflects the order of occurrence of each sentence in the document.
Each sentence JSON object is compliant with the class the structure modeled by theSentenceJSON
class.
-