|
|
TIKA-3068
|
Fix release configuration for tika-server as a service
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3063
|
Tika parser / POI crash with IndexOutOfBoundsException error
|
Unassigned
|
MRIT64
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3060
|
Unpack file .ppt leads to TikaException
|
Tim Allison
|
Carina Antunes
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3057
|
Improve detection of zip-based formats
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3055
|
Add an optional PreflightPDFParser
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3054
|
[Dependency] Cross-site Scripting (XSS) in org.apache.cxf:cxf-rt-transports-http 3.3.2
|
Tim Allison
|
Michael Moritz
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3052
|
[Dependency] Unsafe Dependancy Resolution in com.beust:jcommander 1.35
|
Tim Allison
|
Michael Moritz
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3050
|
Add xmp extraction to psd files
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3047
|
Upgrade to POI 4.1.2
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3045
|
Allow users to run custom parsing of xfa and xmp
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3042
|
Date format extraction problem in XLS/XLSX
|
Tim Allison
|
Zoltan Farago
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3041
|
ExtractInlineImages missing images from PDFBOX-52
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3040
|
PDF inline OCR: Exception while processing certain image (others in same PDF work)
|
Unassigned
|
Markus Mandalka
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3039
|
Remove mvn dockerfile:build goal from tika-server
|
Unassigned
|
Eric Pugh
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3037
|
Tika Docs should highlight Tika-Server
|
Unassigned
|
Eric Pugh
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3035
|
Tika-app --extract mode outputs to stderr instead of stdout
|
Tim Allison
|
Soren Daugaard
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3033
|
Upgrade to PDFBox 2.0.19 when available
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3031
|
NumberFormatException while parsing a certain PDF document
|
Unassigned
|
Jan Vlug
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3030
|
XLS files with a root node named WORKBOOK don't get parsed
|
Tim Allison
|
Clark Perkins
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3026
|
Consider extracting structure/tags where possible in PDFs with the PDFMarkedContentExtractor
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3023
|
Text files starting with MOVI are detected as X-SGI-Movie
|
Unassigned
|
Steve
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3021
|
Upgrade to PDFBOX 2.0.18
|
Tim Allison
|
Jorge Spinsanti
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3020
|
Keynote Parser | KeynoteContentHandler - <tr> start & end element handler method being called incorrectly
|
Unassigned
|
Syed Osama Anwer
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3017
|
OOM in XSLFSheet.java
|
Tim Allison
|
Don
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3016
|
Old Excel Parser fails with ToXMLHandler
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3015
|
TNEFParser fails with ToXMLContentHandler
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3013
|
TSDParser should pass wrapped handler into handle attachments
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3012
|
NPE caused by multiple calls to start/end document in RFC822Parser
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3011
|
Need to add release version for maven-compiler-plugin
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3010
|
Tika needs service installation script
|
Unassigned
|
Eric Pugh
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3006
|
Regression in PDF keywords extraction since 1.23
|
Tim Allison
|
David Pilato
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2997
|
Add embedded depth as a metadata field populated by RecursiveParserWrapperHandler
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2992
|
java.lang.UnsupportedOperationException: This feature requires ASM7 in Tika 1.21
|
Unassigned
|
Arvind Jain
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2964
|
Upgrade Jackson Databind dependency to 2.9.10.1 or 2.10.0 to fix latest CVEs
|
Unassigned
|
Alex Ott
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2952
|
Vulnerable "metadata-extractor 2.11.0" is present in tika 1.22.
|
Tim Allison
|
Aman Mishra
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2546
|
com.pff:java-libpst is branch EOL
|
Luís Filipe Nassif
|
Richard Jones
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2224
|
OneNote formats support - Mime Magic and Parser
|
Tim Allison
|
Nick Burch
|
|
Resolved |
Fixed
|
|
|
|
|