Quantcast

Apache Tika - Development

This forum is an archive for the mailing list tika-dev@lucene.apache.org (more options) Messages posted here will be sent to this mailing list.
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
1 ... 441442443444445446447 ... 463
Topics (16185)
Replies Last Post Views
Fwd: a 'lite' version of ooxml-schemas jar by Jukka Zitting
0
by Jukka Zitting
[jira] Created: (TIKA-337) SWF parser by JIRA jira@apache.org
3
by JIRA jira@apache.org
[jira] Commented: (TIKA-147) Add Flash parser by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Created: (TIKA-329) secure-processing not supported by some JAXP implementations (2) by JIRA jira@apache.org
2
by JIRA jira@apache.org
[jira] Created: (TIKA-334) HtmlParser should use CharsetDetector whenever no charset is specified via meta http-equiv tag by JIRA jira@apache.org
2
by JIRA jira@apache.org
[jira] Commented: (TIKA-147) Add Flash parser by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Created: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode by JIRA jira@apache.org
21
by JIRA jira@apache.org
[jira] Created: (TIKA-309) Mime type application/rdf+xml not correctly detected by JIRA jira@apache.org
14
by JIRA jira@apache.org
Missing href attribute handling by kkrugler
0
by kkrugler
[jira] Created: (TIKA-333) Improve accuracy of charset detection for HTML pages by JIRA jira@apache.org
1
by JIRA jira@apache.org
[jira] Created: (TIKA-331) Windings font recognition in Tika parsing + spacing issue by JIRA jira@apache.org
4
by JIRA jira@apache.org
[ANNOUNCE] Apache Tika 0.5 Released by Mattmann, Chris A (3...
6
by Karl Heinz Marbaise-...
[jira] Created: (TIKA-330) Better HWP (Hangul Word Processor) detection pattern by JIRA jira@apache.org
1
by JIRA jira@apache.org
Build failed in Hudson: Tika-trunk #226 by Apache Hudson Server
10
by Apache Hudson Server
[jira] Created: (TIKA-271) secure-processing not supported by some JAXP implementations by JIRA jira@apache.org
3
by JIRA jira@apache.org
[VOTE] Apache Tika 0.5 release candidate #1 by Mattmann, Chris A (3...
10
by Jukka Zitting
[jira] Created: (TIKA-325) tika-parent/pom.xml missing <inceptionYear>2007</inceptionYear> by JIRA jira@apache.org
3
by JIRA jira@apache.org
[jira] Created: (TIKA-326) Map javax.imageio.IIOException to TikaException by JIRA jira@apache.org
1
by JIRA jira@apache.org
[jira] Created: (TIKA-320) Allow disabling language detection in AutoDetectParser by JIRA jira@apache.org
2
by JIRA jira@apache.org
[jira] Created: (TIKA-322) Improve encoding detection speed and accuracy by JIRA jira@apache.org
1
by JIRA jira@apache.org
[jira] Created: (TIKA-209) Language detection is weak. by JIRA jira@apache.org
12
by JIRA jira@apache.org
Build failed in Hudson: Tika-trunk #217 by Apache Hudson Server
1
by Apache Hudson Server
Build failed in Hudson: Tika-trunk » Apache Tika parent #217 by Apache Hudson Server
1
by Apache Hudson Server
Hudson build became unstable: Tika-trunk #213 by Apache Hudson Server
3
by Apache Hudson Server
Hudson build became unstable: Tika-trunk » Apache Tika parsers #213 by Apache Hudson Server
3
by Apache Hudson Server
Build Unstable by Mattmann, Chris A (3...
0
by Mattmann, Chris A (3...
Parse context - class or map? by Jukka Zitting
5
by Jukka Zitting
Tika facade - static or not by Jukka Zitting
8
by Mattmann, Chris A (3...
[jira] Created: (TIKA-313) patch: ODF improvements for svg:desc, presentation notes by JIRA jira@apache.org
2
by JIRA jira@apache.org
[jira] Created: (TIKA-315) Tika appears to skip over an entire section of a Microsoft Word Document by JIRA jira@apache.org
4
by JIRA jira@apache.org
[jira] Created: (TIKA-319) HtmlParser - use encoding hint only if charset is supported by JIRA jira@apache.org
1
by JIRA jira@apache.org
[jira] Commented: (TIKA-94) Speech recognition by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Created: (TIKA-275) Parse context by JIRA jira@apache.org
1
by JIRA jira@apache.org
0.5 release by Mattmann, Chris A (3...
0
by Mattmann, Chris A (3...
[jira] Created: (TIKA-314) Initial support for JPEG EXIF metadata extraction by JIRA jira@apache.org
8
by JIRA jira@apache.org
1 ... 441442443444445446447 ... 463