Apache Tika - Development

This forum is an archive for the mailing list tika-dev@lucene.apache.org (more options) Messages posted here will be sent to this mailing list.
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
1 ... 599600601602603604605 ... 609
Topics (21301)
Replies Last Post Views
[jira] Created: (TIKA-141) Mime Content Type detection of a web document from its URL. by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Created: (TIKA-139) Add a composite parser by JIRA jira@apache.org
4
by JIRA jira@apache.org
[jira] Created: (TIKA-92) Image metadata extraction with Sanselan by JIRA jira@apache.org
2
by JIRA jira@apache.org
New Tika components added in JIRA and issues classified by chrismattmann
1
by Jukka Zitting
[jira] Created: (TIKA-61) Add namespaces to our metadata keys by JIRA jira@apache.org
3
by JIRA jira@apache.org
[jira] Created: (TIKA-94) Speech recognition by JIRA jira@apache.org
1
by JIRA jira@apache.org
[jira] Created: (TIKA-98) Add ApertureParser by JIRA jira@apache.org
1
by JIRA jira@apache.org
[jira] Created: (TIKA-93) OCR support by JIRA jira@apache.org
1
by JIRA jira@apache.org
[jira] Created: (TIKA-95) Pluggable magic header detectors by JIRA jira@apache.org
1
by JIRA jira@apache.org
[jira] Created: (TIKA-87) MimeTypes should allow modification of MIME types by JIRA jira@apache.org
2
by JIRA jira@apache.org
Tika board report due by Bertrand Delacretaz-...
2
by Bertrand Delacretaz-...
New Tika committer by Jukka Zitting
3
by Bertrand Delacretaz-...
[jira] Created: (TIKA-113) Metadata (such as title) should not be part of content by JIRA jira@apache.org
4
by JIRA jira@apache.org
[jira] Created: (TIKA-138) Better HTML parsing by JIRA jira@apache.org
3
by JIRA jira@apache.org
[jira] Created: (TIKA-136) Exception during command line calling by JIRA jira@apache.org
2
by JIRA jira@apache.org
[jira] Created: (TIKA-137) Move CLI and GUI into separate sub-modules by JIRA jira@apache.org
2
by JIRA jira@apache.org
Tika on the Fast Feather Track by Jukka Zitting
3
by Jukka Zitting
TIKA-134 by Karl Heinz Marbaise-...
1
by Jukka Zitting
[jira] Created: (TIKA-134) mvn package does not produce packages for bin/src by JIRA jira@apache.org
2
by JIRA jira@apache.org
TIKA - 136 by Karl Heinz Marbaise-...
0
by Karl Heinz Marbaise-...
TIKA 135 by Karl Heinz Marbaise-...
0
by Karl Heinz Marbaise-...
[jira] Created: (TIKA-133) TeeContentHandler constructor should use varargs by JIRA jira@apache.org
1
by JIRA jira@apache.org
Links in documents by thorsten
6
by Jukka Zitting-3
What kind of files do you support? by Karl Heinz Marbaise-...
3
by Jukka Zitting-3
Streaming vs. other features in parsers by Jukka Zitting-3
4
by Niall Pemberton
[jira] Created: (TIKA-128) HTML parser should produce XHTML SAX events by JIRA jira@apache.org
1
by JIRA jira@apache.org
[jira] Created: (TIKA-131) Lazy XHTML prefix generation by JIRA jira@apache.org
1
by JIRA jira@apache.org
[jira] Created: (TIKA-130) self-or-descendant axis does not match self in streaming XPath by JIRA jira@apache.org
1
by JIRA jira@apache.org
[jira] Created: (TIKA-129) node() support for the streaming XPath utility by JIRA jira@apache.org
1
by JIRA jira@apache.org
Metadata design by Jukka Zitting
13
by Jérôme Charron-2
[jira] Created: (TIKA-127) Add support for Visio files by JIRA jira@apache.org
1
by JIRA jira@apache.org
Documentation by thorsten
2
by thorsten
[jira] Created: (TIKA-122) Use Commons IO 1.4 by JIRA jira@apache.org
1
by JIRA jira@apache.org
Working with unreleased POI code by Jukka Zitting
3
by Jukka Zitting
[jira] Created: (TIKA-125) Pass Locale information to parsers by JIRA jira@apache.org
1
by JIRA jira@apache.org
1 ... 599600601602603604605 ... 609