ASF JIRA

Tika
1.7
Key descending
161 of 61 as at: 19/Apr/24 12:27
T Patch Info Key Summary Assignee Reporter P Status Resolution Created Updated Due Development
Bug TIKA-1506

OutlookPSTParser not closing PSTFile's InputStream, causing exception when called by AutoDetectParser

Unassigned Tim Allison Blocker Resolved Fixed  
Bug TIKA-1503

TestGDALParser fails if gdalinfo does not support FITS

Tyler Bui-Palsulich Sebastian Nagel Minor Resolved Fixed  
Improvement TIKA-1502

Mime magic for database file formats

Unassigned Nick Burch Major Resolved Fixed  
Bug TIKA-1500

FeedParser extracts XML markup with BodyContentHandler

Tyler Bui-Palsulich Reinhard Pötz Minor Resolved Fixed  
Improvement TIKA-1496

Upgrade slf4j-log4j12 to version 1.7.7

Tyler Bui-Palsulich Tyler Bui-Palsulich Minor Resolved Done  
New Feature TIKA-1494

JAXRS server: allow passing PDF password in the request

Unassigned Peter Bowyer Major Resolved Fixed  
Improvement TIKA-1491

Identification of BPG (Better Portable Graphics) format

Unassigned Johan van der Knijff Minor Resolved Fixed  
Improvement TIKA-1490

Basic parser for old Excel files (eg Excel 4)

Nick Burch Nick Burch Major Resolved Fixed  
Improvement TIKA-1487

Add mime for pre-OLE2 xls file

Unassigned Tim Allison Trivial Resolved Fixed  
Bug TIKA-1477

Add custom header processing to allow overriding of OCR and PDF configuration to be used in Tika Server

Dave Meikle Dave Meikle Minor Resolved Fixed  
Improvement TIKA-1476

Allow TesseractOCRParser to be configured using an external configuration file

Dave Meikle Dave Meikle Minor Resolved Implemented  
Task TIKA-1475

Reformat pom.xml files

Tyler Bui-Palsulich Tyler Bui-Palsulich Trivial Resolved Done  
Bug TIKA-1472

Warning on Tika Server startup - Failed to load class "org.slf4j.impl.StaticLoggerBinder"

Chris A. Mattmann Darya Arbuzova Minor Resolved Fixed  
Bug TIKA-1470

Error installing Tika

Tyler Bui-Palsulich Darya Arbuzova Minor Resolved Fixed  
Improvement TIKA-1469

Upgrade to POI 3.11-beta3 when available

Unassigned Tim Allison Minor Resolved Fixed  
Bug TIKA-1461

Bad mime detection of certain JAR file

Unassigned Tamas Cservenak Major Resolved Fixed  
Bug TIKA-1459

Fix write limit bug in BasicContentHandlerFactory for BodyContentHandler

Unassigned Tim Allison Trivial Resolved Fixed  
Improvement TIKA-1451

Add Recursive Metadata Parser Wrapper output to tika-app and gui

Unassigned Tim Allison Minor Resolved Fixed  
Bug TIKA-1450

Tika does not detect the correct mime-type for webp images

Unassigned Nelson Monterroso Minor Resolved Fixed  
Bug TIKA-1448

CHM parser : defect in file extraction

Unassigned Bin Hawking Major Resolved Fixed  
Bug TIKA-1447

CHM parser: wrong directory list

Unassigned Bin Hawking Critical Resolved Fixed  
Bug TIKA-1446

CHM parser : wrong decompression of aligned blocks

Unassigned Bin Hawking Critical Resolved Fixed  
Bug TIKA-1445

Figure out how to add Image metadata extraction to Tesseract parser

Chris A. Mattmann Chris A. Mattmann Blocker Resolved Fixed  
Improvement TIKA-1444

Detection for VirtualPC VHD files

Unassigned Luís Filipe Nassif Minor Resolved Fixed  
Improvement TIKA-1442

Upgrade to PDFBox 1.8.8

Tim Allison Tim Allison Major Closed Fixed  
Bug TIKA-1441

ExternalParsers should allow dynamic keys to be specified for Regexs

Chris A. Mattmann Chris A. Mattmann Major Resolved Fixed  
Bug TIKA-1438

PhoneExtractingContentHandler to not add individual MD entries for individual phone numbers

Lewis John McGibbney Lewis John McGibbney Minor Closed Not A Problem  
Bug TIKA-1430

CHM parser gets faulty text (fix found)

Unassigned Bin Hawking Critical Resolved Fixed  
Bug TIKA-1422

org.apache.tika.parser.mail.RFC822ParserTest fails

Chris A. Mattmann Chris A. Mattmann Major Resolved Fixed  
Bug TIKA-1421

Tika-Parsers tests fail on CentOS6 if tesseract isn't installed

Chris A. Mattmann Chris A. Mattmann Blocker Resolved Fixed  
Improvement TIKA-1420

Add Metadata Extraction to Arbitrary Parsers

Tyler Bui-Palsulich Tyler Bui-Palsulich Minor Resolved Fixed  
New Feature TIKA-1418

Add TikaConfigDumperExample to example package

Unassigned Tim Allison Trivial Resolved Fixed  
Bug TIKA-1412

NPE in OpenDocumentParser

Unassigned Andrzej Bialecki Major Resolved Fixed  
Bug TIKA-1411

Temporary 7z file leak

Unassigned Luís Filipe Nassif Major Resolved Fixed  
Bug TIKA-1410

Temporary OLE File Leak

Unassigned Luís Filipe Nassif Major Resolved Fixed  
Bug TIKA-1404

tika-app server leaking temporary files when converting Word97 (doc)

Nick Burch Lukas Graf Major Resolved Fixed  
Improvement TIKA-1399

[Patch] add support for AxCrypt file type detection

Unassigned Florent Angebault Major Resolved Fixed  
Sub-task TIKA-1394

TIKA-1390 Create RecursiveMetadata example

Tyler Bui-Palsulich Tyler Bui-Palsulich Minor Closed Duplicate  
Sub-task TIKA-1393

TIKA-1390 Create Translator example

Tyler Bui-Palsulich Tyler Bui-Palsulich Minor Closed Fixed  
Sub-task TIKA-1392

TIKA-1390 Create a LanguageIdentifier example

Tyler Bui-Palsulich Tyler Bui-Palsulich Minor Closed Fixed  
Sub-task TIKA-1391

TIKA-1390 Create Parser.parse() example

Tyler Bui-Palsulich Tyler Bui-Palsulich Minor Resolved Fixed  
Bug TIKA-1389

Convert all wildcard imports to explicit imports

Tyler Bui-Palsulich Tyler Bui-Palsulich Minor Resolved Fixed  
Improvement TIKA-1387

Add forbidden-apis checker to TIKA build

Tyler Bui-Palsulich Uwe Schindler Major Resolved Fixed  
Bug TIKA-1385

Create an External Translator

Tyler Bui-Palsulich Tyler Bui-Palsulich Major Resolved Fixed  
Improvement TIKA-1384

Use tika-parent dependency management for common dependencies

Tyler Bui-Palsulich Tyler Bui-Palsulich Minor Resolved Done  
Improvement TIKA-1380

Upgrade to Apache POI 3.11 beta 1

Unassigned Nick Burch Major Resolved Fixed  
Improvement TIKA-1377

As a user, I would like to see "album artist", "disc number", and "compilation" in parsed MP3 and MP4 types

Unassigned Daniel O. Becker Minor Resolved Fixed  
Bug TIKA-1371

passing parameters via URL no longer works (regression)

Unassigned Rob Tulloh Major Resolved Fixed  
Bug TIKA-1369

Date parsing and thread safety in ImageMetadataExtractor

Chris A. Mattmann John Gibson Critical Resolved Fixed  
Bug TIKA-1355

Unexpected RuntimeException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser

Unassigned Sven Krüger Minor Resolved Fixed  
Bug TIKA-1354

ForkParser doesn't work in OSGI container

Unassigned Michal Hlavac Major Closed Fixed  
Bug TIKA-1289

Ligatures convert on text extraction

Unassigned Alex Andrushchak Major Resolved Fixed  
Improvement TIKA-1246

Include LastModifiedDate in metadata of archive entries

Unassigned Luís Filipe Nassif Minor Resolved Fixed  
Task TIKA-1242

Update CXF version to 3.0.2

Sergey Beryozkin Sergey Beryozkin Minor Resolved Fixed  
Bug TIKA-1218

Unable to parse a mp3 file on 1.5 getting a exception

Tyler Bui-Palsulich Sumeet Gorab Blocker Resolved Fixed  
Bug TIKA-1167

Embedded object not extracted

Tyler Bui-Palsulich Daniel Bonniot de Ruisselet Critical Resolved Fixed  
Bug TIKA-1118

OOXML parser throws when relationship points to 0 byte embedded part

Unassigned Lee Graber Major Resolved Fixed  
Bug TIKA-672

Proper error handling in the CHM parser

Unassigned Jukka Zitting Minor Resolved Fixed  
New Feature TIKA-605

Tika GDAL parser

Chris A. Mattmann Chris A. Mattmann Major Resolved Fixed  
Bug TIKA-595

HtmlHandler does not support multivalue metadata

Dave Meikle Lutz Pumpenmeier Minor Resolved Fixed  
New Feature TIKA-93

OCR support

Chris A. Mattmann Jukka Zitting Minor Resolved Fixed