一、问题描述

 

有个需求就是读取word文档里的内容,使用到了poi这个包,代码如下:

    /**
     * 读取doc文件内容
     *
     * @param fs 想要读取的文件对象
     * @return 返回文件内容
     * @throws IOException
     */
    public static String doc2String(BufferedInputStream fs) throws IOException {
        String text = "";
        if (FileMagic.valueOf(fs) == FileMagic.OLE2) {
            WordExtractor ex = new WordExtractor(fs);
            text = ex.getText();
            ex.close();
        } else if (FileMagic.valueOf(fs) == FileMagic.OOXML) {
            XWPFDocument doc = new XWPFDocument(fs);
            XWPFWordExtractor extractor = new XWPFWordExtractor(doc);
            text = extractor.getText();
            extractor.close();
        }
        return text;
    }

    public static String doc2String(File file) throws IOException {
        return doc2String(new BufferedInputStream(new FileInputStream(file)));
    }

    public static void main(String[] args) {
        File file = new File("D:\\xxx\\xxx\\1\\file\\2021\\04\\28\\34a58ac4faa4222712a4329ac60f34f9\\34a58ac4faa4222712a4329ac60f34f9.docx");
        try {
            System.out.println(doc2String(file));
        } catch (IOException e) {
            e.printStackTrace();
        }
    }

运行报错:

Exception in thread "main" java.lang.NoSuchMethodError: org.openxmlformats.schemas.wordprocessingml.x2006.main.impl.CTRImpl.getXmlObjectArray(Ljavax/xml/namespace/QName;[Lorg/apache/xmlbeans/XmlObject;)[Lorg/apache/xmlbeans/XmlObject;
	at org.openxmlformats.schemas.wordprocessingml.x2006.main.impl.CTRImpl.getDrawingArray(CTRImpl.java:3979)
	at org.apache.poi.xwpf.usermodel.XWPFRun.<init>(XWPFRun.java:96)
	at org.apache.poi.xwpf.usermodel.XWPFRun.<init>(XWPFRun.java:146)
	at org.apache.poi.xwpf.usermodel.XWPFParagraph.buildRunsInOrderFromXml(XWPFParagraph.java:118)
	at org.apache.poi.xwpf.usermodel.XWPFParagraph.<init>(XWPFParagraph.java:67)
	at org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:178)
	at org.apache.poi.ooxml.POIXMLDocument.load(POIXMLDocument.java:169)
	at org.apache.poi.xwpf.usermodel.XWPFDocument.<init>(XWPFDocument.java:126)

 

二、解决方法

 

找不到org.openxmlformats.schemas.wordprocessingml.x2006.main.impl.CTRImpl.getXmlObjectArray类,出现这种原因,肯定是少引入了相关的jar或者版本错误导致的,像我这里就是错误的引入了包导致的:

    <dependency>
      <groupId>org.apache.poi</groupId>
      <artifactId>poi-ooxml-schemas</artifactId>
      <version>4.1.2</version>
    </dependency>

我这里引入的是poi-ooxml-schemas,这个包是个精简过的,所以有些类没有,官方的说明如下:http://poi.apache.org/help/faq.html

引入poi-ooxml包就行了,完整依赖如下:

    <dependency>
      <groupId>org.apache.poi</groupId>
      <artifactId>poi</artifactId>
      <version>5.0.0</version>
    </dependency>

    <dependency>
      <groupId>org.apache.poi</groupId>
      <artifactId>poi-ooxml</artifactId>
      <version>5.0.0</version>
    </dependency>
    <dependency>
      <groupId>org.apache.poi</groupId>
      <artifactId>poi-scratchpad</artifactId>
      <version>5.0.0</version>
    </dependency>
<!--    <dependency>
      <groupId>org.apache.poi</groupId>
      <artifactId>poi-ooxml-schemas</artifactId>
      <version>4.1.2</version>
    </dependency>-->
    <dependency>
      <groupId>org.apache.poi</groupId>
      <artifactId>poi-ooxml-full</artifactId>
      <version>5.0.0</version>
    </dependency>
    <dependency>
      <groupId>org.apache.poi</groupId>
      <artifactId>poi</artifactId>
      <version>5.0.0</version>
    </dependency>

 

Logo

GitCode 天启AI是一款由 GitCode 团队打造的智能助手,基于先进的LLM(大语言模型)与多智能体 Agent 技术构建,致力于为用户提供高效、智能、多模态的创作与开发支持。它不仅支持自然语言对话,还具备处理文件、生成 PPT、撰写分析报告、开发 Web 应用等多项能力,真正做到“一句话,让 Al帮你完成复杂任务”。

更多推荐