lucene: add a field to index, based on html meta tag

  • Follow


I have a question about building an index file.I've been using the Lucene demo fromhttp://lucene.apache.org/java/2_1_0/demo.htmlI want to add a field named "category" to my HTML documents, and ideally Iwould like to do this by reading a meta tag in the HTML document, so thatwhen searching I can use a term like "category:spare_parts" to limit thehits returned.E.g., when indexing the file123456789.html the tag <meta name="category"content="spare_parts"> would put the value "spare parts" in the "category"field.So how could I do this?Regards,Keith.
0
Reply Keith 10/23/2007 1:48:56 AM


0 Replies
318 Views

(page loaded in 0.018 seconds)

Similiar Articles:










7/24/2012 2:51:07 AM


Reply: