|
|
TIKA-3029
|
to extract information from ppt formats along with tables and image content
|
Unassigned
|
aashika
|
|
Open |
Unresolved
|
|
|
|
|
|
|
TIKA-2917
|
Extract metadata from inline images in PDFs
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2909
|
Contributing HWP v5 Parser
|
Unassigned
|
soomyung
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2908
|
TikaException: Failed to close temporary resource - how to fix?
|
Tim Allison
|
Marichi Gupta
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2907
|
Blanket dependency upgrades for next release cycle (1.22)
|
Unassigned
|
Tim Allison
|
|
Resolved |
Duplicate
|
|
|
|
|
|
|
TIKA-2905
|
Allow users to skip list markup in RTF
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2903
|
RereadableInputStream does not close storeOutputStream in all casses in, temporary files remain locked
|
Unassigned
|
Peter Fassev
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2899
|
org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.rtf.RTFParser@375a26af
|
Tim Allison
|
Pandurang
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2898
|
wrong email send date being set in OutlookPSTParser
|
Tim Allison
|
Paul Woods
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2896
|
NullPointerException in MimeTypesReader.releaseParser()
|
Unassigned
|
Eamonn Saunders
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2895
|
duplicate entry for application/x-gtar mime signature
|
Tim Allison
|
Richard Lehane
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2893
|
Upgrade to PDFBox 2.0.16 when available
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2887
|
Build fails with CVE warnings
|
Unassigned
|
T Craig
|
|
Resolved |
Duplicate
|
|
|
|
|
|
|
TIKA-2886
|
StreamingZipContainerDetector fails on XLSX template workbook
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2885
|
Vulnerabilities in tika
|
Unassigned
|
Kevin Ng
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2883
|
Text not extracted from RTF files
|
Tim Allison
|
Luís Filipe Nassif
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2879
|
Download page gpg example needs second parameter
|
Unassigned
|
Sebb
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2876
|
Configure tesseract/PDF configs in parseContext in UnpackerResource via httpheaders
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2790
|
Consider switching lang-detection in tika-eval to open-nlp
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2500
|
Apache Tika do not extract first line of the RTF file, It only extract last three char of first line.
|
Tim Allison
|
Rohit Sureshrao Shelhalkar
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2150
|
RTF TextExtractor omits some content
|
Tim Allison
|
T. Schmidt
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1731
|
Try to integrate java-hwp into Tika
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1728
|
Detection is not working properly for detecting HWP 5.0 file
|
Unassigned
|
mungeol heo
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1713
|
RTF parser misses text content
|
Tim Allison
|
Mike Cantrell
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1568
|
AutoDetectReader performance problem
|
Tim Allison
|
Andrzej Bialecki
|
|
Resolved |
Fixed
|
|
|
|
|