public class StrExtractor
extends java.lang.Object
构造器和说明 |
---|
StrExtractor() |
限定符和类型 | 方法和说明 |
---|---|
static java.lang.String |
getDOCContent(com.seeyon.ctp.common.file.model.CtpFile file) |
static java.lang.String |
getHTMLContent(java.lang.String content) |
static java.lang.String |
getHtmlORTxtContent(com.seeyon.ctp.common.file.model.CtpAbstractFile file) |
static java.lang.String |
getOfficeContent(com.seeyon.ctp.common.file.model.CtpAbstractFile file,
java.lang.String... mimeType)
支持Office2003(Word,Excel,PowerPoint,Visio)
支持Office2007(Word,Excel,PowerPoint,Visio)
如果需要解析Excel,需要传入mimeType参数
IFileParser.MIME_XLS:2003格式的Excel IFileParser.MIME_XLSX:2007格式的Excel |
static java.lang.String |
getPDFContent(com.seeyon.ctp.common.file.model.CtpAbstractFile file) |
static java.lang.String |
getPPTContent(com.seeyon.ctp.common.file.model.CtpAbstractFile file) |
static java.lang.String |
getRTFContent(com.seeyon.ctp.common.file.model.CtpAbstractFile file) |
static java.lang.String |
getText(java.lang.String bodyType,
java.lang.String bodyContent,
java.util.Date bodyCreateDate) |
static java.lang.String |
getTXTContent(com.seeyon.ctp.common.file.model.CtpAbstractFile file) |
static java.lang.String |
getV3XFileContent(V3XFile v3xFile)
解析V3XFile成为字符串,当前支持的类型有:
text/plain
application/msword
application/vnd.ms-excel
application/vnd.ms-powerpoint
application/vnd.openxmlformats-officedocument.wordprocessingml.document
application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
application/vnd.openxmlformats-officedocument.presentationml.presentation
application/vnd.visio
|
static java.lang.String |
getVisioContent(com.seeyon.ctp.common.file.model.CtpAbstractFile file) |
static java.lang.String |
getWpsExcelContent(com.seeyon.ctp.common.file.model.CtpFile file)
解析Wps Excel文档
|
static java.lang.String |
getWpsWordContent(com.seeyon.ctp.common.file.model.CtpFile file)
解析 Wps Word 文档
|
static java.lang.String |
getXLSContent(com.seeyon.ctp.common.file.model.CtpFile file,
java.lang.String... mimeType) |
public static java.lang.String getText(java.lang.String bodyType, java.lang.String bodyContent, java.util.Date bodyCreateDate) throws UnknowBodyTypeException
public static java.lang.String getWpsExcelContent(com.seeyon.ctp.common.file.model.CtpFile file)
file
- public static java.lang.String getWpsWordContent(com.seeyon.ctp.common.file.model.CtpFile file)
file
- public static java.lang.String getHTMLContent(java.lang.String content)
public static java.lang.String getPDFContent(com.seeyon.ctp.common.file.model.CtpAbstractFile file)
public static java.lang.String getDOCContent(com.seeyon.ctp.common.file.model.CtpFile file)
public static java.lang.String getRTFContent(com.seeyon.ctp.common.file.model.CtpAbstractFile file)
public static java.lang.String getXLSContent(com.seeyon.ctp.common.file.model.CtpFile file, java.lang.String... mimeType)
public static java.lang.String getTXTContent(com.seeyon.ctp.common.file.model.CtpAbstractFile file)
public static java.lang.String getHtmlORTxtContent(com.seeyon.ctp.common.file.model.CtpAbstractFile file)
public static java.lang.String getPPTContent(com.seeyon.ctp.common.file.model.CtpAbstractFile file)
public static java.lang.String getVisioContent(com.seeyon.ctp.common.file.model.CtpAbstractFile file)
public static java.lang.String getOfficeContent(com.seeyon.ctp.common.file.model.CtpAbstractFile file, java.lang.String... mimeType)
file
- public static java.lang.String getV3XFileContent(V3XFile v3xFile)
text/plain application/msword application/vnd.ms-excel application/vnd.ms-powerpoint application/vnd.openxmlformats-officedocument.wordprocessingml.document application/vnd.openxmlformats-officedocument.spreadsheetml.sheet application/vnd.openxmlformats-officedocument.presentationml.presentation application/vnd.visio
v3xFile
-