Innauguro il Topic.
Postate qui problemi, soluzioni....
Io ora mi sto studiando il funzionamento di LUCENE e di WORDNET.
Pensavo di fare il progetto 2.III
Ciao!! Tanto per cominciare male...non riesco a far partire neanche le demo di lucene....
Cosa intende per {index-dir} nel comando "java........demo.IndexHTML -create -index {index-dir}? La directory che voglio indicizzare? Ossia la sotto directory di webapps di Tomcat? Oppure non sto capendo nulla? Inoltre devo creare io .../opt/lucene/index?
Grazie
A.3 Running the web application demo
The web demo is slightly involved to set up and run properly. You need a web container; our instructions are for Tomcat 5. The docs/demo.html documentation provides detailed instructions for setting up and running the web application, but you can also follow the steps provided here.
The index used by the web application differs slightly from that in the command-line demo. First, it restricts itself to indexing only .html, .htm, and .txt files. Each file it processes (including .txt files) is parsed using a custom rudimentary HTML parser. To build the index initially, execute IndexHTML:
[code type="markup"]java -cp lucene-1.4.2.jar;lucene-demos-1.4.2.jar
org.apache.lucene.demo.IndexHTML -create -index webindex docs
.
.
.
adding docs/resources.html
adding docs/systemproperties.html
adding docs/whoweare.html
Optimizing index...
7220 total milliseconds[/code]
The -index webindex switch sets the location of the index directory. In a moment, you’ll need the full path to this directory to configure the web application. The final docs argument to IndexHTML is the directory tree to index. The –create switch creates an index from scratch. Remove this switch to update the index with files that have been added or changed since the last time the index was built.
Qualcuno ha agito diversamente per indicizzare il file cran.all su Lucene?
Qualcuno ha agito diversamente per indicizzare il file cran.all su Lucene?
Io ho creato un mio parser.
Non è difficile.
Ci vuole solo molta pazienza... :shock: