ASF JIRA

Tika
1.16
Key descending
132 of 32 as at: 29/Mar/24 05:30
T Patch Info Key Summary Assignee Reporter P Status Resolution Created Updated Due Development
Improvement TIKA-2424

Don't include ml model .bin files in src.zip

Unassigned Tim Allison Blocker Resolved Fixed  
Improvement TIKA-2422

Improve detection of Graphviz *.dot format

Unassigned Sebastian Nagel Minor Resolved Fixed  
Bug TIKA-2418

English ASCII text classified as video/quicktime

Unassigned Christopher Creutzig Major Resolved Fixed  
Improvement TIKA-2416

Upgrade dependencies in tika-eval

Unassigned Tim Allison Trivial Resolved Fixed  
Improvement TIKA-2414

Upgrade gson to 2.8.1

Unassigned Tim Allison Trivial Resolved Fixed  
Improvement TIKA-2413

Upgrade mime4j to 0.8.1

Unassigned Tim Allison Major Resolved Fixed  
Improvement TIKA-2411

Clean up tika-bundle

Unassigned Tim Allison Trivial Resolved Fixed  
Bug TIKA-2410

RTF parser is tagging non-bold text as bold

Unassigned Dave Kincaid Major Resolved Fixed  
Bug TIKA-2408

ZipException in text extraction from DOCX file

Unassigned Jorge Spinsanti Major Resolved Workaround  
Bug TIKA-2405

SAXParseException in text extraction from DOCX file

Unassigned Jorge Spinsanti Major Resolved Workaround  
Bug TIKA-2404

XMLException in DOCX->TXT conversion

Unassigned Jorge Spinsanti Major Resolved Workaround  
Bug TIKA-2399

Version conflict with non-ASL jai-imageio-jpeg2000 and edu.ucar jj2000

Tim Allison Tim Allison Major Resolved Fixed  
Improvement TIKA-2391

Extract <script> elements in html as "attachment" type MACRO like we do in the PDFParser

Tim Allison Tim Allison Minor Resolved Fixed  
Bug TIKA-2388

Problem in Tika().detect for ODB (Open Office database) files

Unassigned Alessandro Scaldaferro Critical Resolved Fixed  
Improvement TIKA-2386

Improve digest options

Unassigned Tim Allison Major Resolved Fixed  
Bug TIKA-2384

Double close of InputStream in accept text/plain in tika-server

Unassigned Tim Allison Blocker Resolved Fixed  
Improvement TIKA-2380

Upgrade to Jackcess 2.1.8 when available

Unassigned Tim Allison Major Resolved Fixed  
Bug TIKA-2379

tika-bundle 1.15 has wrong import of org.sfl4j.event package which does not exists

Bob Paulin Claus Ibsen Blocker Resolved Fixed  
Bug TIKA-2378

Error extracting text from application/x-msaccess mime type

Unassigned Steve Reynolds Minor Resolved Fixed  
Sub-task TIKA-2377

TIKA-1804 Remove org.json from TEIParser

Unassigned Tim Allison Blocker Resolved Fixed  
Task TIKA-2376

Avoid org.json dependency

Unassigned Claus Ibsen Blocker Resolved Fixed  
Improvement TIKA-2374

Tika App -z should extract PDF inline images by default

Unassigned Nick Burch Major Reopened Unresolved  
Improvement TIKA-2341

Upgrade to commons compress 1.14 when available

Tim Allison Tim Allison Minor Resolved Fixed  
Improvement TIKA-2336

Upgrade to POI 3.17-beta1 when available

Unassigned Tim Allison Minor Resolved Fixed  
Improvement TIKA-2335

Extract path info from Excel 2013 .xlsx and .xlsb

Unassigned Tim Allison Trivial Resolved Fixed  
Improvement TIKA-2298

To improve object recognition parser so that it may work without external RESTful service setup

Chris A. Mattmann Avtar Singh Major Resolved Fixed  
Improvement TIKA-2254

Provide chart support for MS Office documents

Tim Allison Chris Bamford Minor Resolved Fixed  
Bug TIKA-2201

OutOfMemoryError on a reasonably sized document

Unassigned Seva Alekseyev Major Resolved Workaround  
Bug TIKA-2089

Macros not extracted from ppt files

Tim Allison Tim Allison Minor Resolved Fixed  
Bug TIKA-1945

Powerpoint parser doesn't extract text from diagrams

Tim Allison Nick C Major Resolved Fixed  
Bug TIKA-1804

Tika use no free json.org

Unassigned gil cattaneo Blocker Resolved Fixed  
New Feature TIKA-1106

CLAVIN Integration

Chris A. Mattmann Adam Estrada Minor Resolved Won't Fix