Lucene

The Apache Lucene project develops open-source search software In a variety of languages.  
The flagship sub-project is "Lucene - Java", many questions people have about using the "Lucene Library" can be best addressed on the java-users mailing list.
This is the Lucene mailing list archive and forum.
1 ... 8259826082618262826382648265 ... 9319
Topics (326132)
Replies Last Post Views Sub Forum
Nutch crawls parent directories and ignores the url filters added to prevent this in crawl-urlfilter.txt by Godmar Back
0
by Godmar Back
Nutch - User
[Nutch Wiki] Update of "FAQ" by GodmarBack by Apache Wiki
0
by Apache Wiki
Nutch - Dev
[Nutch Wiki] Update of "FAQ" by GodmarBack by Apache Wiki
0
by Apache Wiki
Nutch - Dev
[Nutch Wiki] Update of "FAQ" by GodmarBack by Apache Wiki
0
by Apache Wiki
Nutch - Dev
[jira] Created: (NUTCH-407) Make Nutch crawling parent directories for file protocol configurable by JIRA jira@apache.org
7
by JIRA jira@apache.org
Nutch - Dev
[Nutch Wiki] Update of "FAQ" by GodmarBack by Apache Wiki
0
by Apache Wiki
Nutch - Dev
[jira] Created: (TIKA-358) Auto-detection of HTML fails with common auto-generated template by JIRA jira@apache.org
1
by JIRA jira@apache.org
Apache Tika - Development
Doc/vector to cluster relevance? by Bogdan94202
0
by Bogdan94202
Mahout User List
[jira] Created: (SOLR-1690) JSONKeyValueTokenizerFactory -- JSON Tokenizer by JIRA jira@apache.org
6
by JIRA jira@apache.org
Solr - Dev
Re: Re: Dedup remove all duplicates by Pascal Dimassimo
0
by Pascal Dimassimo
Nutch - User
[jira] Created: (LUCENE-860) site should call project "Lucene Java", not just "Lucene" by JIRA jira@apache.org
7
by JIRA jira@apache.org
Lucene - Java Developer
[jira] Created: (LUCENE-2035) TokenSources.getTokenStream() does not assign positionIncrement by JIRA jira@apache.org
10
by JIRA jira@apache.org
Lucene - Java Developer
Lucene Java 2.9.2 by George Aroush
9
by Mark Miller-3
Lucene - Java Developer
crawl-urlfilter.txt & regex-urlfilter.txt by Ken Ken
3
by MilleBii
Nutch - User
Dedup remove all duplicates by Pascal Dimassimo
1
by Andrzej Białecki-2
Nutch - User
Solr Cell - PDFs plus literal metadata - GET or POST ? by Ross Keatinge
3
by Ross Keatinge
Solr - User
Extracting Essence of Page by filtering Advertisements by Ted Yu-3
0
by Ted Yu-3
Nutch - User
[jira] Created: (NUTCH-655) Injecting Crawl metadata by JIRA jira@apache.org
15
by JIRA jira@apache.org
Nutch - Dev
Compiling mahout on OS X by Markus Weimer-3
2
by IsabelDrost
Mahout User List
performance question by Steve A.
11
by A. Steven Anderson
Solr - User
build/nutch.xml by Ken Ken
2
by Godmar Back
Nutch - User
Re: Nutch & Lucene Installation Instructions by Mattmann, Chris A (3...
0
by Mattmann, Chris A (3...
Nutch - User
How do you check a field has been indexed correctly if not stored ? by Paul Taylor-2
5
by Erick Erickson
Lucene - Java Users
Clustering techniques, tips and tricks by Grant Ingersoll-2
33
by Grant Ingersoll-2
Mahout User List
Switching from Store.YES to Store.NO by Babak Farhang-2
5
by Michael McCandless-2
Lucene - Java Users
[jira] Created: (SOLR-1705) Move QueryConvertor into SpellCheckComponent configuration by JIRA jira@apache.org
2
by JIRA jira@apache.org
Solr - Dev
Rules engine and Solr by Avlesh Singh
6
by Avlesh Singh
Solr - User
Methods for Naming Clusters by Paul Ingles
38
by Dawid Weiss-2
Mahout User List
Solr Replication Questions by Giovanni Fernandez-K...
1
by Noble Paul നോബിള്‍ ...
Solr - User
replicating extension JARs by Ryan Kennedy-3
1
by Noble Paul നോബിള്‍ ...
Solr - User
[jira] Created: (SOLR-1212) TestNG Test Case by JIRA jira@apache.org
8
by JIRA jira@apache.org
Solr - Dev
Threads, revisited by Marvin Humphrey
2
by Marvin Humphrey
lucy dev
NPE in MoreLikeThis referenced doc not found and debugQuery=True by David Stuart
2
by hossman
Solr - Dev
SolrException.ErrorCode by Grant Ingersoll-2
1
by hossman
Solr - Dev
Listing Terms by Ascending IDF value . . ? by Christopher Ball-2
7
by Christopher Ball-2
Solr - User
1 ... 8259826082618262826382648265 ... 9319