如何对带有RTL的文档使用WordToHtmlConverter

我正在尝试转换.doc文件(和docx文件,但此软件包不支持) 使用org.apache.poi.hwpf.converter.wordToHtmlConvertor https://poi.apache.org/apidocs/dev/org/apache/poi/hwpf/converter/WordToHtmlConverter.html

所有文本在输出HTML中的格式设置为LTR。有没有办法格式化文字方向?

下面使用的代码: 谢谢。

package docConverter;

import java.io.ByteArrayOutputStream;
import java.io.FileInputStream;
import java.io.FileOutputStream;
import javax.xml.parsers.DocumentBuilderFactory;
import javax.xml.transform.OutputKeys;
import javax.xml.transform.Transformer;
import javax.xml.transform.TransformerFactory;
import javax.xml.transform.dom.DOMSource;
import javax.xml.transform.stream.StreamResult;

import org.apache.poi.hwpf.HWPFDocumentCore;
import org.apache.poi.hwpf.converter.WordToHtmlConverter;
import org.apache.poi.hwpf.converter.WordToHtmlUtils;
import org.w3c.dom.Document;


public class docConverter {
    public static void main(String[] args)throws Exception
    {
        HWPFDocumentCore wordDocument = WordToHtmlUtils.loadDoc(new FileInputStream("data/test1.doc"));

        WordToHtmlConverter wordToHtmlConverter = new WordToHtmlConverter(
                DocumentBuilderFactory.newInstance().newDocumentBuilder()
                        .newDocument());
        wordToHtmlConverter.processDocument(wordDocument);
        Document htmlDocument = wordToHtmlConverter.getDocument();

        ByteArrayOutputStream out = new ByteArrayOutputStream();
        DOMSource domSource = new DOMSource(htmlDocument);
        StreamResult streamResult = new StreamResult(out);

        TransformerFactory tf = TransformerFactory.newInstance();
        Transformer serializer = tf.newTransformer();
        serializer.setOutputProperty(OutputKeys.ENCODING,"UTF-8");
        serializer.setOutputProperty(OutputKeys.INDENT,"yes");
        serializer.setOutputProperty(OutputKeys.METHOD,"html");

        serializer.transform(domSource,streamResult);
        out.close();

        FileOutputStream outputStream = new FileOutputStream("data/test1.html");
        outputStream.write(out.toByteArray());
        outputStream.close();

        System.out.println("Apache - Doc file converted in specified formats");
    }
}
dong2apple 回答:如何对带有RTL的文档使用WordToHtmlConverter

暂时没有好的解决方案,如果你有好的解决方案,请发邮件至:iooj@foxmail.com
本文链接:https://www.f2er.com/2446894.html

大家都在问