Sub-task
- [NUTCH-1126] - JUnit test for urlfilter-prefix
Bug
- [NUTCH-1475] - Index-More Plugin -- A better fall back value for date field
- [NUTCH-1571] - SolrInputSplit doesn't implement Writable and crawl script doesn't pass crawlId to generate and updatedb tasks
- [NUTCH-1591] - Incorrect conversion of ByteBuffer to String
Improvement
- [NUTCH-1420] - Get rid of the dreaded �
- [NUTCH-1585] - Ensure duplicate tags do not exist in microformat-reltag tag set.
- [NUTCH-1649] - Sentence Detection plugin
Task
- [NUTCH-1522] - Upgrade to Tika 1.3
- [NUTCH-1578] - Upgrade to Hadoop 1.2.0
Edit/Copy Release Notes
The text area below allows the project release notes to be edited and copied to another document.