Hi!
Thomas Höschele wrote:
Hallo,
checked broken attachments. There some put these are picture files only,
which should not be affected by the lucene engine.
I've seen issues with PDF files including only images. I'd fix all and
every problem with attachments before debugging problems with Lucene. I
guess you can configure Lucene to exclude some file extensions, but I
don't now how to do that.
Also how can I make the rebuild with the scheduler, my
code does not seem to
work.
My Search Administration don't work neither with Lucene as default
search engine. To force a rebuild of the index, simply delete its files
and restart your applications server/servlet container. About index
files location, xwiki.cfg reads:
#-# Lucene search engine
#-# Location where to place the lucene index files. The default is the
"lucene" subdirectory in the container's "work"
Please, could you tell something else about your installation? Thanks!
Thanks for your answer.
Thomas
-----Ursprüngliche Nachricht-----
Von: users-bounces(a)xwiki.org [mailto:users-bounces@xwiki.org] Im Auftrag von
[Ricardo Rodriguez] eBioTIC.
Gesendet: Donnerstag, 28. Oktober 2010 11:54
An: XWiki Users
Betreff: Re: [xwiki-users] Lucene File Indexing
Hi!
Thomas Höschele wrote:
Hallo,
I noticed that lucene is capable of Indexing various office files (excel,
word, outlook mails); however, this doesn't happen automatically.
When I add the file it does not get indexed no matter how long I wait.
When
I rebuild the Index via the administration
terminal the files get indexed.
Please, Thomas, could you give more details about your installation?
XWiki release? Servlet container/application server (Jetty, Tomcat,
GlassFish,..)? Database?
>
>
> Also I get an error in the logs (error getting content of attachment ... of
> doc ...).
>
>
Could you also check if you have broken attachments and those are the
ones causing these errors? You can use this:
http://code.xwiki.org/xwiki/bin/view/Snippets/AllBrokenAttachments
> Is this an Lucene Error or has it something todo with my configuration?
I had the same type of errors here (XE/XEM 2.4.1 running on a Suse Linux
10sp3 box with Apache Tomcat 5.5.27 and Java 1.5.0 ). All but a small
bunch where related with broken attachments. Among these, three Excel
spreadsheets with xlsb format. The rest, PDF files. You talk about
Microsoft Office files, let's see if we can reproduce the error!
I'll try in a new installation running the last XE snapshot ASAP. I'll
keept this thread posted. If you could confirm that some xlsb files are
causing problems in Lucene, I think it will be worth to create a new
Jira issue on this.
Thanks!
>
>
> Thomas
>
> _______________________________________________
> users mailing list
> users(a)xwiki.org
>
http://lists.xwiki.org/mailman/listinfo/users
>
--
Ricardo Rodríguez
CTO
eBioTIC.
Life Sciences, Data Modeling and Information Management Systems