ASF JIRA

Tika
1.25
Key descending
143 of 43 as at: 25/Apr/24 15:49
T Patch Info Key Summary Assignee Reporter P Status Resolution Created Updated Due Development
Task TIKA-3236

Upgrade cxf-core to 3.3.8

Tim Allison Jesper HÃ¥steen Minor Resolved Fixed  
Bug TIKA-3232

security vulnerability in dependencies

Tim Allison Shayne Grant Major Resolved Fixed  
Task TIKA-3230

Upgrade junit and turn off ossindex warning

Tim Allison Tim Allison Trivial Resolved Fixed  
New Feature TIKA-3228

Add file name and extension to FileProfiler

Unassigned Tim Allison Minor Resolved Fixed  
Task TIKA-3217

Extract metadata from XMPPDFSchema in PDFs' XMP

Tim Allison Tim Allison Trivial Resolved Fixed  
Task TIKA-3216

Add FileProfiler to tika-eval

Tim Allison Tim Allison Major Resolved Fixed  
Task TIKA-3215

Add a detector that calls the 'file' command

Unassigned Tim Allison Minor Resolved Fixed  
Improvement TIKA-3210

tika status endpoint should have a Node UUID

Unassigned Nicholas DiPiazza Minor Resolved Fixed  
Bug TIKA-3208

tika-server Detect when using fileUrl header does not close the file handle

Unassigned Darren Cooper Major Resolved Fixed  
Bug TIKA-3207

Invalid language code in TesseractOCRConfig

Tim Allison Daniel Smyda Minor Resolved Fixed  
Improvement TIKA-3204

License incompliance with xmp-core 6.1.10

Unassigned Christian Seipel Blocker Resolved Fixed  
Bug TIKA-3203

MP4Parser temporary files are not deleted from Tomcat temp folder

Unassigned Isabelle Giguere Major Resolved Fixed  
Task TIKA-3193

Add mime detection for avif

Tim Allison Tim Allison Major Resolved Fixed  
Bug TIKA-3191

Issue with GrobidJournalParser

Dave Meikle Nav Major Resolved Fixed  
Task TIKA-3188

Add IDML Parser

Dave Meikle Dave Meikle Major Resolved Implemented  
Bug TIKA-3170

PDF extraction space issue

Unassigned Akash Major Closed Duplicate  
Bug TIKA-3159

Macros not extracted from OpenDocument format Office files (flatXML format)

Tim Allison Robert Kaulbach Minor Resolved Fixed  
Bug TIKA-3158

Macros not extracted from OpenDocument format Office files (zip format)

Tim Allison Robert Kaulbach Minor Resolved Fixed  
Bug TIKA-3156

Missing content from .odt file with hyperlinked image

Dave Meikle Robert Kaulbach Minor Resolved Fixed  
Bug TIKA-3153

Text File identified as message/rfc822

Unassigned Akash Major Resolved Fixed  
Task TIKA-3147

Strip punctuation in lang id component within tika-eval

Unassigned Tim Allison Major Resolved Fixed  
Task TIKA-3146

Add Nutch's TextProfileSignature digest to tika-eval's text stats

Tim Allison Tim Allison Major Resolved Fixed  
Task TIKA-3145

Add a content digester to tika-eval text stats

Tim Allison Tim Allison Major Resolved Fixed  
Task TIKA-3140

Add a metadata filter for tika-eval

Tim Allison Tim Allison Major Resolved Fixed  
Bug TIKA-3138

PDF parser with XFA produce malformed XML

Tim Allison wiwi Major Resolved Fixed  
Task TIKA-3137

Enable a metadata filter for the RecursiveParserWrapper

Tim Allison Tim Allison Major Resolved Fixed  
Task TIKA-3135

No need to spool file for HeifParser

Unassigned Tim Allison Major Resolved Fixed  
Improvement TIKA-3133

/rmeta endpoint should not hard code writeLimit and maxEmbeddedResources

Tim Allison Nicholas DiPiazza Trivial Resolved Fixed  
Bug TIKA-3131

PDFParserConfig default values were accidentally swapped

Unassigned Clark Perkins Major Resolved Fixed  
Wish TIKA-3126

Consider new endpoint (metadata + content non recursive)

Unassigned Carina Antunes Trivial Resolved Fixed  
Task TIKA-3122

Extract inline image metadata without rendering for PDFs

Unassigned Tim Allison Minor Resolved Fixed  
Task TIKA-3120

Remove whitelist/blacklist terminology

Unassigned Tim Allison Major Resolved Fixed  
Task TIKA-3117

Upgrade to metadata-extractor 2.14.0

Tim Allison Tim Allison Major Resolved Fixed  
Bug TIKA-3112

NullPointerException at AbstractPDF2XHTML.extractXMPXFA() when using tika-app GUI

Tim Allison Ip Smile Major Resolved Fixed  
Task TIKA-3101

Include XMPSchemaBasic metadata in xmp metadata extraction

Unassigned Tim Allison Major Resolved Fixed  
Bug TIKA-3095

tika-bundle tests fail on windows due to missing jcip-annotations

Bob Paulin Bob Paulin Minor Resolved Fixed  
Bug TIKA-3094

Apache Tika fails to extract text for pptx extension.

Bob Paulin Abhishek Chauhan Critical Resolved Fixed  
Task TIKA-3083

Consider adding a fuzzing module

Tim Allison Tim Allison Major Resolved Fixed  
Task TIKA-3078

Enable standard configuration of GeoParser

Tim Allison Tim Allison Major Resolved Fixed  
Task TIKA-3071

tika-server's unpacker should pass the parent parser into the parsecontext to be used for inline parsing

Tim Allison Tim Allison Major Resolved Fixed  
Bug TIKA-3009

XML Parser reset() detection no working in weblogic 12.2.1.3

Tim Allison Daniel Critical Resolved Fixed  
Improvement TIKA-2888

Add wmv2 codec detection to ASF container

Unassigned David Avendasora Major Resolved Fixed  
Bug TIKA-2443

Plain text file identified as rfc822 and which can cause StackOverflowError

Unassigned Viorica Visan Major Resolved Fixed