ASF JIRA

Tika
1.22
Key descending
125 of 25 as at: 28/Mar/24 18:11
T Patch Info Key Summary Assignee Reporter P Status Resolution Created Updated Due Development
Improvement TIKA-3029

to extract information from ppt formats along with tables and image content

Unassigned aashika Major Open Unresolved  
Improvement TIKA-2917

Extract metadata from inline images in PDFs

Tim Allison Tim Allison Minor Resolved Fixed  
New Feature TIKA-2909

Contributing HWP v5 Parser

Unassigned soomyung Major Resolved Fixed  
Bug TIKA-2908

TikaException: Failed to close temporary resource - how to fix?

Tim Allison Marichi Gupta Blocker Resolved Fixed  
Task TIKA-2907

Blanket dependency upgrades for next release cycle (1.22)

Unassigned Tim Allison Major Resolved Duplicate  
Task TIKA-2905

Allow users to skip list markup in RTF

Tim Allison Tim Allison Major Resolved Fixed  
Bug TIKA-2903

RereadableInputStream does not close storeOutputStream in all casses in, temporary files remain locked

Unassigned Peter Fassev Critical Resolved Fixed  
Bug TIKA-2899

org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.rtf.RTFParser@375a26af

Tim Allison Pandurang Critical Resolved Fixed  
Bug TIKA-2898

wrong email send date being set in OutlookPSTParser

Tim Allison Paul Woods Major Resolved Fixed  
Bug TIKA-2896

NullPointerException in MimeTypesReader.releaseParser()

Unassigned Eamonn Saunders Major Resolved Fixed  
Bug TIKA-2895

duplicate entry for application/x-gtar mime signature

Tim Allison Richard Lehane Minor Resolved Fixed  
Task TIKA-2893

Upgrade to PDFBox 2.0.16 when available

Tim Allison Tim Allison Minor Resolved Fixed  
Bug TIKA-2887

Build fails with CVE warnings

Unassigned T Craig Major Resolved Duplicate  
Task TIKA-2886

StreamingZipContainerDetector fails on XLSX template workbook

Tim Allison Tim Allison Major Resolved Fixed  
Bug TIKA-2885

Vulnerabilities in tika

Unassigned Kevin Ng Blocker Resolved Fixed  
Bug TIKA-2883

Text not extracted from RTF files

Tim Allison Luís Filipe Nassif Major Resolved Fixed  
Bug TIKA-2879

Download page gpg example needs second parameter

Unassigned Sebb Major Resolved Fixed  
Task TIKA-2876

Configure tesseract/PDF configs in parseContext in UnpackerResource via httpheaders

Tim Allison Tim Allison Trivial Resolved Fixed  
Improvement TIKA-2790

Consider switching lang-detection in tika-eval to open-nlp

Tim Allison Tim Allison Major Resolved Fixed  
Bug TIKA-2500

Apache Tika do not extract first line of the RTF file, It only extract last three char of first line.

Tim Allison Rohit Sureshrao Shelhalkar Major Resolved Fixed  
Bug TIKA-2150

RTF TextExtractor omits some content

Tim Allison T. Schmidt Major Resolved Fixed  
New Feature TIKA-1731

Try to integrate java-hwp into Tika

Unassigned Tim Allison Minor Resolved Fixed  
Bug TIKA-1728

Detection is not working properly for detecting HWP 5.0 file

Unassigned mungeol heo Major Resolved Fixed  
Bug TIKA-1713

RTF parser misses text content

Tim Allison Mike Cantrell Major Resolved Fixed  
Bug TIKA-1568

AutoDetectReader performance problem

Tim Allison Andrzej Bialecki Major Resolved Fixed