|
|
TIKA-2424
|
Don't include ml model .bin files in src.zip
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2422
|
Improve detection of Graphviz *.dot format
|
Unassigned
|
Sebastian Nagel
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2418
|
English ASCII text classified as video/quicktime
|
Unassigned
|
Christopher Creutzig
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2416
|
Upgrade dependencies in tika-eval
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2414
|
Upgrade gson to 2.8.1
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2413
|
Upgrade mime4j to 0.8.1
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2411
|
Clean up tika-bundle
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2410
|
RTF parser is tagging non-bold text as bold
|
Unassigned
|
Dave Kincaid
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2408
|
ZipException in text extraction from DOCX file
|
Unassigned
|
Jorge Spinsanti
|
|
Resolved |
Workaround
|
|
|
|
|
|
|
TIKA-2405
|
SAXParseException in text extraction from DOCX file
|
Unassigned
|
Jorge Spinsanti
|
|
Resolved |
Workaround
|
|
|
|
|
|
|
TIKA-2404
|
XMLException in DOCX->TXT conversion
|
Unassigned
|
Jorge Spinsanti
|
|
Resolved |
Workaround
|
|
|
|
|
|
|
TIKA-2399
|
Version conflict with non-ASL jai-imageio-jpeg2000 and edu.ucar jj2000
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2391
|
Extract <script> elements in html as "attachment" type MACRO like we do in the PDFParser
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2388
|
Problem in Tika().detect for ODB (Open Office database) files
|
Unassigned
|
Alessandro Scaldaferro
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2386
|
Improve digest options
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2384
|
Double close of InputStream in accept text/plain in tika-server
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2380
|
Upgrade to Jackcess 2.1.8 when available
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2379
|
tika-bundle 1.15 has wrong import of org.sfl4j.event package which does not exists
|
Bob Paulin
|
Claus Ibsen
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2378
|
Error extracting text from application/x-msaccess mime type
|
Unassigned
|
Steve Reynolds
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2377
|
TIKA-1804
Remove org.json from TEIParser
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2376
|
Avoid org.json dependency
|
Unassigned
|
Claus Ibsen
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2374
|
Tika App -z should extract PDF inline images by default
|
Unassigned
|
Nick Burch
|
|
Reopened |
Unresolved
|
|
|
|
|
|
|
TIKA-2341
|
Upgrade to commons compress 1.14 when available
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2336
|
Upgrade to POI 3.17-beta1 when available
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2335
|
Extract path info from Excel 2013 .xlsx and .xlsb
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2298
|
To improve object recognition parser so that it may work without external RESTful service setup
|
Chris A. Mattmann
|
Avtar Singh
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2254
|
Provide chart support for MS Office documents
|
Tim Allison
|
Chris Bamford
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2201
|
OutOfMemoryError on a reasonably sized document
|
Unassigned
|
Seva Alekseyev
|
|
Resolved |
Workaround
|
|
|
|
|
|
|
TIKA-2089
|
Macros not extracted from ppt files
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1945
|
Powerpoint parser doesn't extract text from diagrams
|
Tim Allison
|
Nick C
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1804
|
Tika use no free json.org
|
Unassigned
|
gil cattaneo
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1106
|
CLAVIN Integration
|
Chris A. Mattmann
|
Adam Estrada
|
|
Resolved |
Won't Fix
|
|
|
|
|