|
|
TIKA-3198
|
extracting ppt with chart give excel in which data is missing
|
Unassigned
|
sagar
|
|
Open |
Unresolved
|
|
|
|
|
|
|
TIKA-2725
|
Make tika-server robust against ooms/infinite loops/memory leaks
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2721
|
Exclude Spring (transitive dependency) from tika-parsers
|
Konstantin Gribov
|
Konstantin Gribov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
TIKA-2716
|
Sonatype Nexus auditor is reporting that spring framework vesrion used by Tika 1.18 is vulnerable
|
Konstantin Gribov
|
Abhijit Rajwade
|
|
Closed |
Won't Fix
|
|
|
|
|
|
|
TIKA-2707
|
Upgrade to commons-compress 1.18
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2706
|
Store exceptions from VBAMacroReader as we do other embedded exceptions
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2705
|
Allow configuration of TesseractOCRParser as we do for other parsers
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2704
|
MPEGStream should throw an EOF if appropriate in skipFrame
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2703
|
Error indexing a xlsx file
|
Tim Allison
|
Mario Bisonti
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2699
|
Security: Sonatype Nexus scan is reporting multiple vulnearbilities on the bouncy castle version used by Apache Tika
|
Unassigned
|
Abhijit Rajwade
|
|
Resolved |
Duplicate
|
|
|
|
|
|
|
TIKA-2695
|
Upgrade Lucene in tika-eval and tika-example
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2693
|
Tika 1.17 uses the wrong classloader for reflection
|
Unassigned
|
Karl Wright
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2692
|
Blanket upgrades in prep for 1.19
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2691
|
Can't create a RPM
|
Tim Allison
|
Celpan Valeria
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2690
|
Exclude commons-logging & commons-logging-api from uimafit-core
|
Unassigned
|
Hans Brende
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2688
|
MBOX not recognized when unknown X-headers are present
|
Tim Allison
|
Yury Kats
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2687
|
Avoid potential to overwrite attachments
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2686
|
pdfbox fontbox 2.0.8 has security vulnerability CVE-2018-8036 and should be upgraded to 2.0.11
|
Unassigned
|
Abhijit Rajwade
|
|
Resolved |
Duplicate
|
|
|
|
|
|
|
TIKA-2683
|
Missing space and inappropriate new-line in Boilerpipe extracted text
|
Kenneth William Krugler
|
Karanjeet Singh
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2682
|
Upgrade jempbox to 1.8.15
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2681
|
Upgrade to PDFBox 2.0.11
|
Konstantin Gribov
|
Konstantin Gribov
|
|
Closed |
Fixed
|
|
|
|
|
|
|
TIKA-2679
|
Bump 1.x branch to Java 1.8
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2677
|
ConcurrentModificationException in org.apache.tika.mime.MediaTypeRegistry.getAliases
|
Tim Allison
|
Yuriy Koval
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2675
|
OpenDocumentParser should fail on invalid zip files
|
Tim Allison
|
Sebastian Nagel
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2673
|
HtmlEncodingDetector doesn't follow the specification
|
Tim Allison
|
Gerard Bouchar
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2672
|
Upgrade dl4j to 1.0.0-beta2
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2669
|
Tika JAX-RS PDF parser option / custom config issue
|
Tim Allison
|
Annie Didier
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2668
|
Fix 'can't overwrite cause' exception in TaggedSAXException in Java 11-ea
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2667
|
Upgrade jmatio to 1.4
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2664
|
Upgrade junrar to 1.0.1
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2662
|
Add a streaming out option for the Json serialization
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2661
|
Upgrade commons-compress to 1.17
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2657
|
Add System.exit() and heavy gc hang to MockParser
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2656
|
Allow users to specify timeout for parsing and/or waiting in ForkParser
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2655
|
Allow the RecursiveParserWrapper to work with the ForkParser
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2653
|
Allow users to specify a directory of jars for classloading in ForkParser
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2647
|
Create a "security" page on our website
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2645
|
Reuse SAXParsers where possible
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2644
|
Improve RecursiveParserWrapper API
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2636
|
ENVI Header metadata fields can span more than one line
|
Lewis John McGibbney
|
Lewis John McGibbney
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2629
|
Add image/x-dpx media-type detection
|
Unassigned
|
Andreas Meier
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2628
|
Add image/aces media-type detection
|
Unassigned
|
Andreas Meier
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2577
|
Sonatype Nexus Auditor is reporting that the Bouncy castle version used by Tika 1.17 is vulnerable
|
Unassigned
|
Abhijit Rajwade
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2552
|
Upgrade to POI 4.0.0 when available
|
Tim Allison
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2549
|
NoSuchMethodException "CTPictureBaseImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)" parsing certain .docx files
|
Unassigned
|
Adam Rauch
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2520
|
OptimaizeLangDetector#loadModels() should not be called for every single langdetect HTTP request
|
Chris A. Mattmann
|
Vincent van Donselaar
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2479
|
Handle empty cells in tables uniformly
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2462
|
Add a parser for sas7bdat
|
Unassigned
|
Tim Allison
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2446
|
Tainted Zip file can provoke OOM errors
|
Unassigned
|
Thorsten Schäfer
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-2100
|
Html Parser does not keep the html tag attributes
|
Unassigned
|
Gerard Bouchar
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
TIKA-1675
|
please avoid xmlbeans dependency
|
Unassigned
|
Robert Muir
|
|
Resolved |
Fixed
|
|
|
|
|