Apache Tika - Development

This forum is an archive for the mailing list tika-dev@lucene.apache.org (more options) Messages posted here will be sent to this mailing list.
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
1 ... 500501502503504505506 ... 516
Topics (18031)
Replies Last Post Views
[jira] Created: (TIKA-207) MS word doc containing tracked changes produces incorrect text by JIRA jira@apache.org
0
by JIRA jira@apache.org
Release emails by Grant Ingersoll-2
2
by Grant Ingersoll-2
[VOTE] Apache Tika 0.3 by Mattmann, Chris A (3...
5
by Jukka Zitting
Use of general@l.a.o for... by Grant Ingersoll-2
0
by Grant Ingersoll-2
[jira] Created: (TIKA-179) Tika stand alone CLI --text output mostly not working, other output formats are fine by JIRA jira@apache.org
8
by JIRA jira@apache.org
[jira] Updated: (TIKA-61) Add namespaces to our metadata keys by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Updated: (TIKA-79) Mime type detection from file header appears to be failing. by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Resolved: (TIKA-79) Mime type detection from file header appears to be failing. by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Updated: (TIKA-80) Utility method in MimeUtils to perform full mime resolution using all available strategies by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Resolved: (TIKA-69) ParseUtils methods need to support Metadata by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Updated: (TIKA-74) Test Resources should be loaded by the class loader (e.g. getResourceAsStream()). by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Updated: (TIKA-79) Mime type detection from file header appears to be failing. by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Updated: (TIKA-121) MimeType.clean method no longer exists as a capability by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Created: (TIKA-205) Factor out met keys in MimeTypesReader representing XML tag/attr names by JIRA jira@apache.org
1
by JIRA jira@apache.org
[jira] Created: (TIKA-194) Support java regular expressions in glob pattern spec for mime repo by JIRA jira@apache.org
1
by JIRA jira@apache.org
Board report time by Jukka Zitting
1
by Mattmann, Chris A (3...
0.3 release (Was: --text / -t CLI option produces no output) by Jukka Zitting
2
by Jukka Zitting
[jira] Resolved: (TIKA-152) Support for Office XML files by JIRA jira@apache.org
0
by JIRA jira@apache.org
--text / -t CLI option produces no output by Aldus Whitfield
7
by Mattmann, Chris A (3...
ParsingReader and PackageParser by Jonathan Koren
3
by Jukka Zitting
Welcome Jukka Zitting to the Lucene PMC by Grant Ingersoll-2
2
by David Meikle
Reading metadata without downloading entire file by Nick Lothian
4
by Nick Lothian
GSOC by Grant Ingersoll-2
0
by Grant Ingersoll-2
[jira] Commented: (TIKA-147) Add Flash parser by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Created: (TIKA-186) Refactor the MS Office property names to MSOffice.java by JIRA jira@apache.org
4
by JIRA jira@apache.org
[jira] Commented: (TIKA-152) Support for Office XML files by JIRA jira@apache.org
0
by JIRA jira@apache.org
Tika Issue by AmardeepSingh
1
by Jana, Kumar Raja
tika prob by shyamgosavi
1
by Jukka Zitting
Using standard XMP schemas for image and audio metadata by Jukka Zitting
8
by Jonathan Koren
ApacheCon EU Lucene promotion by Grant Ingersoll-2
0
by Grant Ingersoll-2
[jira] Commented: (TIKA-152) Support for Office XML files by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Created: (TIKA-197) Microsoft Outlook (msg) files get parsed multiple times by JIRA jira@apache.org
4
by JIRA jira@apache.org
[jira] Created: (TIKA-202) Warnings during Site generation by JIRA jira@apache.org
2
by JIRA jira@apache.org
[jira] Created: (TIKA-201) Extract lyrics and other text from MIDI audio files by JIRA jira@apache.org
1
by JIRA jira@apache.org
Hudson build became unstable: Tika-trunk ยป Apache Tika #84 by Apache Hudson Server
1
by Apache Hudson Server
1 ... 500501502503504505506 ... 516