|
|
TIKA-1506
|
OutlookPSTParser not closing PSTFile's InputStream, causing exception when called by AutoDetectParser
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1503
|
TestGDALParser fails if gdalinfo does not support FITS
|
Tyler Bui-Palsulich
|
Sebastian Nagel
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1502
|
Mime magic for database file formats
|
Unassigned
|
Nick Burch
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1500
|
FeedParser extracts XML markup with BodyContentHandler
|
Tyler Bui-Palsulich
|
Reinhard Pötz
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1496
|
Upgrade slf4j-log4j12 to version 1.7.7
|
Tyler Bui-Palsulich
|
Tyler Bui-Palsulich
|
|
Resolved |
Done
|
|
|
|
|
|
|
TIKA-1494
|
JAXRS server: allow passing PDF password in the request
|
Unassigned
|
Peter Bowyer
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1491
|
Identification of BPG (Better Portable Graphics) format
|
Unassigned
|
Johan van der Knijff
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1490
|
Basic parser for old Excel files (eg Excel 4)
|
Nick Burch
|
Nick Burch
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1487
|
Add mime for pre-OLE2 xls file
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1477
|
Add custom header processing to allow overriding of OCR and PDF configuration to be used in Tika Server
|
Dave Meikle
|
Dave Meikle
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1476
|
Allow TesseractOCRParser to be configured using an external configuration file
|
Dave Meikle
|
Dave Meikle
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
TIKA-1475
|
Reformat pom.xml files
|
Tyler Bui-Palsulich
|
Tyler Bui-Palsulich
|
|
Resolved |
Done
|
|
|
|
|
|
|
TIKA-1472
|
Warning on Tika Server startup - Failed to load class "org.slf4j.impl.StaticLoggerBinder"
|
Chris A. Mattmann
|
Darya Arbuzova
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1470
|
Error installing Tika
|
Tyler Bui-Palsulich
|
Darya Arbuzova
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1469
|
Upgrade to POI 3.11-beta3 when available
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1461
|
Bad mime detection of certain JAR file
|
Unassigned
|
Tamas Cservenak
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1459
|
Fix write limit bug in BasicContentHandlerFactory for BodyContentHandler
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1451
|
Add Recursive Metadata Parser Wrapper output to tika-app and gui
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1450
|
Tika does not detect the correct mime-type for webp images
|
Unassigned
|
Nelson Monterroso
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1448
|
CHM parser : defect in file extraction
|
Unassigned
|
Bin Hawking
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1447
|
CHM parser: wrong directory list
|
Unassigned
|
Bin Hawking
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1446
|
CHM parser : wrong decompression of aligned blocks
|
Unassigned
|
Bin Hawking
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1445
|
Figure out how to add Image metadata extraction to Tesseract parser
|
Chris A. Mattmann
|
Chris A. Mattmann
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1444
|
Detection for VirtualPC VHD files
|
Unassigned
|
Luís Filipe Nassif
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1442
|
Upgrade to PDFBox 1.8.8
|
Tim Allison
|
Tim Allison
|
|
Closed |
Fixed
|
|
|
|
|
|
|
TIKA-1441
|
ExternalParsers should allow dynamic keys to be specified for Regexs
|
Chris A. Mattmann
|
Chris A. Mattmann
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1438
|
PhoneExtractingContentHandler to not add individual MD entries for individual phone numbers
|
Lewis John McGibbney
|
Lewis John McGibbney
|
|
Closed |
Not A Problem
|
|
|
|
|
|
|
TIKA-1430
|
CHM parser gets faulty text (fix found)
|
Unassigned
|
Bin Hawking
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1422
|
org.apache.tika.parser.mail.RFC822ParserTest fails
|
Chris A. Mattmann
|
Chris A. Mattmann
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1421
|
Tika-Parsers tests fail on CentOS6 if tesseract isn't installed
|
Chris A. Mattmann
|
Chris A. Mattmann
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1420
|
Add Metadata Extraction to Arbitrary Parsers
|
Tyler Bui-Palsulich
|
Tyler Bui-Palsulich
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1418
|
Add TikaConfigDumperExample to example package
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1412
|
NPE in OpenDocumentParser
|
Unassigned
|
Andrzej Bialecki
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1411
|
Temporary 7z file leak
|
Unassigned
|
Luís Filipe Nassif
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1410
|
Temporary OLE File Leak
|
Unassigned
|
Luís Filipe Nassif
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1404
|
tika-app server leaking temporary files when converting Word97 (doc)
|
Nick Burch
|
Lukas Graf
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1399
|
[Patch] add support for AxCrypt file type detection
|
Unassigned
|
Florent Angebault
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1394
|
TIKA-1390
Create RecursiveMetadata example
|
Tyler Bui-Palsulich
|
Tyler Bui-Palsulich
|
|
Closed |
Duplicate
|
|
|
|
|
|
|
TIKA-1393
|
TIKA-1390
Create Translator example
|
Tyler Bui-Palsulich
|
Tyler Bui-Palsulich
|
|
Closed |
Fixed
|
|
|
|
|
|
|
TIKA-1392
|
TIKA-1390
Create a LanguageIdentifier example
|
Tyler Bui-Palsulich
|
Tyler Bui-Palsulich
|
|
Closed |
Fixed
|
|
|
|
|
|
|
TIKA-1391
|
TIKA-1390
Create Parser.parse() example
|
Tyler Bui-Palsulich
|
Tyler Bui-Palsulich
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1389
|
Convert all wildcard imports to explicit imports
|
Tyler Bui-Palsulich
|
Tyler Bui-Palsulich
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1387
|
Add forbidden-apis checker to TIKA build
|
Tyler Bui-Palsulich
|
Uwe Schindler
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1385
|
Create an External Translator
|
Tyler Bui-Palsulich
|
Tyler Bui-Palsulich
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1384
|
Use tika-parent dependency management for common dependencies
|
Tyler Bui-Palsulich
|
Tyler Bui-Palsulich
|
|
Resolved |
Done
|
|
|
|
|
|
|
TIKA-1380
|
Upgrade to Apache POI 3.11 beta 1
|
Unassigned
|
Nick Burch
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1377
|
As a user, I would like to see "album artist", "disc number", and "compilation" in parsed MP3 and MP4 types
|
Unassigned
|
Daniel O. Becker
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1371
|
passing parameters via URL no longer works (regression)
|
Unassigned
|
Rob Tulloh
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1369
|
Date parsing and thread safety in ImageMetadataExtractor
|
Chris A. Mattmann
|
John Gibson
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1355
|
Unexpected RuntimeException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser
|
Unassigned
|
Sven Krüger
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1354
|
ForkParser doesn't work in OSGI container
|
Unassigned
|
Michal Hlavac
|
|
Closed |
Fixed
|
|
|
|
|
|
|
TIKA-1289
|
Ligatures convert on text extraction
|
Unassigned
|
Alex Andrushchak
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1246
|
Include LastModifiedDate in metadata of archive entries
|
Unassigned
|
Luís Filipe Nassif
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1242
|
Update CXF version to 3.0.2
|
Sergey Beryozkin
|
Sergey Beryozkin
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1218
|
Unable to parse a mp3 file on 1.5 getting a exception
|
Tyler Bui-Palsulich
|
Sumeet Gorab
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1167
|
Embedded object not extracted
|
Tyler Bui-Palsulich
|
Daniel Bonniot de Ruisselet
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1118
|
OOXML parser throws when relationship points to 0 byte embedded part
|
Unassigned
|
Lee Graber
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-672
|
Proper error handling in the CHM parser
|
Unassigned
|
Jukka Zitting
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-605
|
Tika GDAL parser
|
Chris A. Mattmann
|
Chris A. Mattmann
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-595
|
HtmlHandler does not support multivalue metadata
|
Dave Meikle
|
Lutz Pumpenmeier
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-93
|
OCR support
|
Chris A. Mattmann
|
Jukka Zitting
|
|
Resolved |
Fixed
|
|
|
|
|