ryoakg
10/14/2015 - 5:18 PM

Tikaでrtf読む.文字の色とか大きさは取れなかった.残念

Tikaでrtf読む.文字の色とか大きさは取れなかった.残念

;;; `boot repl` to go
(set-env! :dependencies '[[org.apache.tika/tika-parsers "1.10"]])
(import '(org.apache.tika metadata.Metadata
                          parser.ParseContext
                          parser.rtf.RTFParser)
        '(java.io StringWriter FileInputStream)
        '(javax.xml.transform sax.SAXTransformerFactory
                              stream.StreamResult
                              OutputKeys))

(let [metadata (Metadata.)
      sw (StringWriter.)
      factory (cast SAXTransformerFactory (SAXTransformerFactory/newInstance))
      handler (doto (.newTransformerHandler factory)
                (.. getTransformer (setOutputProperty OutputKeys/METHOD "xml"))
                (.. getTransformer (setOutputProperty OutputKeys/INDENT "no"))
                (.setResult (StreamResult. sw)))]
  (. (RTFParser. ) parse (FileInputStream. "1.rtf") handler metadata (ParseContext.))
  (-> sw .toString prn)
  )