On 10/20/2010 01:57 AM, [Ricardo Rodriguez] eBioTIC. wrote:
Hi,
[Ricardo Rodriguez] eBioTIC. wrote:
> Hi!
>
> Please, do we have methods to extract meta data from attached files in
> formats like PDF, TIFF, DOC,...? Availability of such a data some times
> relies on user input, but other times, like size and channels
> information in TIFF files, are built in data that could be/are really
> useful for designing scripts to show pictures.
>
> Thanks!
>
> Ricardo
>
There's nothing available in the standard XWiki core. You can write your
own module that handles metadata extraction, using some external
libraries. For the moment, I don't think that this is something
important for the core, so unless someone provides a patch (or sponsors
one of the developers to do it), it shouldn't be implemented any time soon.
http://tinyurl.com/2v3gdj5
http://dev.xwiki.org/xwiki/bin/view/Drafts/MetadataLuceneTutorial
To the best of my understanding this message is not the same what I was
talking about in my brief message, but could it somehow related? I'm of
course extremely far from Jean-Vivien skills! But, could both topics be
somehow related? adding/indexing metatada and indexing already existing
metadata in attached documents? Thanks!
It is related in the sense that it can index this metadata, if obtained
from another module, so that you can query it using the Lucene plugin.
For example, once you get the dimensions of a tiff image, you can store
this information as fields of the indexed item, and then query all
images with a width between 600 and 800 pixels.
--
Sergiu Dumitriu
http://purl.org/net/sergiu/