|
|
lucene: add a field to index, based on html meta tag
I have a question about building an index file.I've been using the Lucene demo fromhttp://lucene.apache.org/java/2_1_0/demo.htmlI want to add a field named "category" to my HTML documents, and ideally Iwould like to do this by reading a meta tag in the HTML document, so thatwhen searching I can use a term like "category:spare_parts" to limit thehits returned.E.g., when indexing the file123456789.html the tag <meta name="category"content="spare_parts"> would put the value "spare parts" in the "category"field.So how could I do this?Regards,Keith.
|
|
0
|
|
|
|
Reply
|
Keith
|
10/23/2007 1:48:56 AM |
|
|
0 Replies
318 Views
(page loaded in 0.018 seconds)
Similiar Articles: how insert tag? - comp.text.pdflucene: add a field to index, based on html meta tag - comp.lang ... I have a question about building an index file.I've been using the Lucene demo fromhttp://lucene ... How to insert CDATA in an XmlBean tag! - comp.lang.java.programmer ...lucene: add a field to index, based on html meta tag - comp.lang ... I suppose one could add a specific tag to Vector to allow it to ... ... nor meant literally, but just ... Newline after xml tag - comp.soft-sys.sasNote: If ... between paragraphs that will stay without ... lucene: add a field to index, based on html meta tag ... Newline after xml tag - comp.soft-sys.sas... code ... How to get newline in HTML? Without BR - comp.lang.javascript ...Note: If ... between paragraphs that will stay without ... lucene: add a field to index, based on html meta tag ... Newline after xml tag - comp.soft-sys.sas... code ... How best to detect duplicate values in a column? - comp.databases ...... investment that was really just a *tag ... > So can removing an index. Or adding another field. ... removing duplicate records based on specific field. - comp ... Const constructor - comp.lang.c++.moderatedMy assumption based on your words is that you would ... f 2/m{moveto}d -1 r 230 350 m 0 1 179{ 1 index show ... I suppose one could add a specific tag to Vector to allow it to ... Sampling: What Nyquist Didn't Say, and What to Do About It - comp ...> Again, pdfLaTeX adds the tags /you/ want it to add, and ... Harmonics from SCR (or triac) based light dimmers likely ... <frown> It's disturbing (in any field) when people *think ... Getting Started with Lucene Setup | Lucid Imagination... Lucene setup with how to index, analyze and search your content easily with java based ... HTML <meta> tag ... html", Field.Store.YES, Field.Index.NOT_ANALYZED)); doc.add ... Tag Archives: lucene - « Captain CodemanSoftware DeveloperWhen you open the index.html ... Term(field, term.GetTerm())); termsQuery.Add(termQuery ... demonstrate using the Lucene.NET index to find similar items based ... 7/24/2012 2:51:07 AM
|
|
|
|
|
|
|
|
|