|
|
TIKA-3236
|
Upgrade cxf-core to 3.3.8
|
Tim Allison
|
Jesper HÃ¥steen
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3232
|
security vulnerability in dependencies
|
Tim Allison
|
Shayne Grant
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3230
|
Upgrade junit and turn off ossindex warning
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3228
|
Add file name and extension to FileProfiler
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3217
|
Extract metadata from XMPPDFSchema in PDFs' XMP
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3216
|
Add FileProfiler to tika-eval
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3215
|
Add a detector that calls the 'file' command
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3210
|
tika status endpoint should have a Node UUID
|
Unassigned
|
Nicholas DiPiazza
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3208
|
tika-server Detect when using fileUrl header does not close the file handle
|
Unassigned
|
Darren Cooper
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3207
|
Invalid language code in TesseractOCRConfig
|
Tim Allison
|
Daniel Smyda
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3204
|
License incompliance with xmp-core 6.1.10
|
Unassigned
|
Christian Seipel
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3203
|
MP4Parser temporary files are not deleted from Tomcat temp folder
|
Unassigned
|
Isabelle Giguere
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3193
|
Add mime detection for avif
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3191
|
Issue with GrobidJournalParser
|
Dave Meikle
|
Nav
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3188
|
Add IDML Parser
|
Dave Meikle
|
Dave Meikle
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
TIKA-3170
|
PDF extraction space issue
|
Unassigned
|
Akash
|
|
Closed |
Duplicate
|
|
|
|
|
|
|
TIKA-3159
|
Macros not extracted from OpenDocument format Office files (flatXML format)
|
Tim Allison
|
Robert Kaulbach
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3158
|
Macros not extracted from OpenDocument format Office files (zip format)
|
Tim Allison
|
Robert Kaulbach
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3156
|
Missing content from .odt file with hyperlinked image
|
Dave Meikle
|
Robert Kaulbach
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3153
|
Text File identified as message/rfc822
|
Unassigned
|
Akash
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3147
|
Strip punctuation in lang id component within tika-eval
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3146
|
Add Nutch's TextProfileSignature digest to tika-eval's text stats
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3145
|
Add a content digester to tika-eval text stats
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3140
|
Add a metadata filter for tika-eval
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3138
|
PDF parser with XFA produce malformed XML
|
Tim Allison
|
wiwi
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3137
|
Enable a metadata filter for the RecursiveParserWrapper
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3135
|
No need to spool file for HeifParser
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3133
|
/rmeta endpoint should not hard code writeLimit and maxEmbeddedResources
|
Tim Allison
|
Nicholas DiPiazza
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3131
|
PDFParserConfig default values were accidentally swapped
|
Unassigned
|
Clark Perkins
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3126
|
Consider new endpoint (metadata + content non recursive)
|
Unassigned
|
Carina Antunes
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3122
|
Extract inline image metadata without rendering for PDFs
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3120
|
Remove whitelist/blacklist terminology
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3117
|
Upgrade to metadata-extractor 2.14.0
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3112
|
NullPointerException at AbstractPDF2XHTML.extractXMPXFA() when using tika-app GUI
|
Tim Allison
|
Ip Smile
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3101
|
Include XMPSchemaBasic metadata in xmp metadata extraction
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3095
|
tika-bundle tests fail on windows due to missing jcip-annotations
|
Bob Paulin
|
Bob Paulin
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3094
|
Apache Tika fails to extract text for pptx extension.
|
Bob Paulin
|
Abhishek Chauhan
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3083
|
Consider adding a fuzzing module
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3078
|
Enable standard configuration of GeoParser
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3071
|
tika-server's unpacker should pass the parent parser into the parsecontext to be used for inline parsing
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3009
|
XML Parser reset() detection no working in weblogic 12.2.1.3
|
Tim Allison
|
Daniel
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2888
|
Add wmv2 codec detection to ASF container
|
Unassigned
|
David Avendasora
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2443
|
Plain text file identified as rfc822 and which can cause StackOverflowError
|
Unassigned
|
Viorica Visan
|
|
Resolved |
Fixed
|
|
|
|
|