public class XWPFWordExtractor extends Object implements POIXMLTextExtractor
| Modifier and Type | Field and Description |
|---|---|
static List<XWPFRelation> |
SUPPORTED_TYPES |
| Constructor and Description |
|---|
XWPFWordExtractor(OPCPackage container) |
XWPFWordExtractor(XWPFDocument document) |
| Modifier and Type | Method and Description |
|---|---|
void |
appendBodyElementText(StringBuilder text,
IBodyElement e) |
void |
appendParagraphText(StringBuilder text,
XWPFParagraph paragraph) |
XWPFDocument |
getDocument()
Returns opened document
|
XWPFDocument |
getFilesystem() |
String |
getText() |
boolean |
isCloseFilesystem() |
void |
setCloseFilesystem(boolean doCloseFilesystem) |
void |
setConcatenatePhoneticRuns(boolean concatenatePhoneticRuns)
Should we concatenate phonetic runs in extraction.
|
void |
setFetchHyperlinks(boolean fetch)
Should we also fetch the hyperlinks, when fetching
the text content? Default is to only output the
hyperlink label, and not the contents
|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitcheckMaxTextSize, close, getCoreProperties, getCustomProperties, getExtendedProperties, getMetadataTextExtractor, getPackagepublic static final List<XWPFRelation> SUPPORTED_TYPES
public XWPFWordExtractor(OPCPackage container) throws IOException
IOExceptionpublic XWPFWordExtractor(XWPFDocument document)
public void setFetchHyperlinks(boolean fetch)
public void setConcatenatePhoneticRuns(boolean concatenatePhoneticRuns)
trueconcatenatePhoneticRuns - If phonetic runs should be concatenatedpublic String getText()
getText in interface POITextExtractorpublic void appendBodyElementText(StringBuilder text, IBodyElement e)
public void appendParagraphText(StringBuilder text, XWPFParagraph paragraph)
public XWPFDocument getDocument()
POIXMLTextExtractorgetDocument in interface POITextExtractorgetDocument in interface POIXMLTextExtractorpublic void setCloseFilesystem(boolean doCloseFilesystem)
setCloseFilesystem in interface POITextExtractorpublic boolean isCloseFilesystem()
isCloseFilesystem in interface POITextExtractorpublic XWPFDocument getFilesystem()
getFilesystem in interface POITextExtractor