|
|
TIKA-4038
|
Fix dependency problem in tika-parsers-standard-package
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-4037
|
Add detection for os2 bitmap array files
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-4035
|
Enable extraction of file system metadata in FileSystemFetcher
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-4034
|
Allow configuration of prettyPrint in FileSystemEmitter
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-4033
|
Improve metadata for incremental updates, take 2
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-4032
|
Look for embedded file name in the content-type field in .eml files
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-4030
|
Fix new npe in docx object handling
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-4029
|
commons-csv 1.10 changed IllegalStateException to IOException
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-4028
|
Add detection for common subtitle format
|
Unassigned
|
Thomas Ledoux
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-4027
|
Improve metadata for incremental updates
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-4025
|
Extract frame count from gifs
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-4022
|
Tika not parsing AVI files
|
Unassigned
|
Gregory Lepore
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-4018
|
Extract more info from warc files
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-4017
|
Add optional detection and parsing of incremental updates in PDF
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-4016
|
Upgrade to PDFBox 2.0.28
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-4014
|
Bump bind exception retry in tika-server to > 5 seconds and fix trivial equality bug
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-4013
|
Extract rendition information from epub files
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-4012
|
Improve extraction of embedded documents in PDFs
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-4011
|
Add detection for ONIXMessage
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-4009
|
GeoTopic Parser package changed incorrectly from o.a.t.parser.geo from o.a.t.parser.geo.topic
|
Chris Mattmann
|
Chris Mattmann
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-4008
|
Change jdk8 job in Jenkins
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3994
|
TIKA-3992
Improve audio/mpeg detection
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3993
|
Improve throttle logic in S3Fetcher
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3991
|
Improve file detection for canon-raw (crw), cr2 and cr3
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3990
|
Close pkg for regular InputStreams in OOXMLExtractorFactory
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3987
|
Add a parser for ActiveMime
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3986
|
JDBCEmitter should strip \u0000 for postgres varchar/strings
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3980
|
Fix ossindex fail(s)
|
Tilman Hausherr
|
Tilman Hausherr
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3976
|
Allow users to configure behavior for zero-byte files
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3971
|
Distinguish eps-based Adobe Illustrator files from pdf-based Illustrator files
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3968
|
Reconstruct embedded file names from associated emf files within docx files
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3963
|
HTML author isn't mapped to its dc:creator counterpart
|
Unassigned
|
Josh Burchard
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-3960
|
PGP encrypted files get detected as application/octet-stream
|
Unassigned
|
Tayseer Sabha
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2689
|
*.ai type (Adobe illustrator ) files are not detected correctly.
|
Unassigned
|
Amit Pandey
|
|
Resolved |
Fixed
|
|
|
|
|