![Tika Tika! Getting started doing OCR with Apache Tika andTesseract from the JVM (Scala, Java, Kotlin…). | by Nathan Perdijk | Codestar blog | Medium Tika Tika! Getting started doing OCR with Apache Tika andTesseract from the JVM (Scala, Java, Kotlin…). | by Nathan Perdijk | Codestar blog | Medium](https://miro.medium.com/v2/resize:fit:1200/1*P93TyWUqN0foFjmMaB0ehQ.jpeg)
Tika Tika! Getting started doing OCR with Apache Tika andTesseract from the JVM (Scala, Java, Kotlin…). | by Nathan Perdijk | Codestar blog | Medium
Apache Tika can not parse Microsoft Docx format in native mode · Issue #6549 · quarkusio/quarkus · GitHub
![Apache Tika do not extract first line of the RTF file, It only extract last three char of first line. - Stack Overflow Apache Tika do not extract first line of the RTF file, It only extract last three char of first line. - Stack Overflow](https://i.stack.imgur.com/cJYuQ.png)