<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Lucene on Saleem Ansari</title>
    <link>/tags/lucene/</link>
    <description>Recent content in Lucene on Saleem Ansari</description>
    <generator>Hugo -- gohugo.io</generator>
    <language>en</language>
    <copyright>(c) 2024 Saleem Ansari</copyright>
    <lastBuildDate>Wed, 17 Apr 2013 00:00:00 +0000</lastBuildDate>
    <atom:link href="/tags/lucene/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>Understanding big Lucene index by inspecting a portion of it</title>
      <link>/2013/04/17/understanding-big-lucene-index-by-inspecting-a-portion-of-it/</link>
      <pubDate>Wed, 17 Apr 2013 00:00:00 +0000</pubDate>
      <guid>/2013/04/17/understanding-big-lucene-index-by-inspecting-a-portion-of-it/</guid>
      <description>I was wondering if I could get a sample out of many huge Lucene indexes and inspect them with Lukeall on my machine. I quickly realized, that copying such indexes over network would be time consuming.&#xA;First I googled for a ready-made solution so that I could copy on a few documents from the whole index into a separate ( small ) index. That way I could quickly understand the document structure.</description>
    </item>
    <item>
      <title>Indexing the documents stored in a database using Apache Solr and Apache Tika</title>
      <link>/2013/02/04/indexing-the-documents-stored-in-a-database-using-apache-solr-and-apache-tika/</link>
      <pubDate>Mon, 04 Feb 2013 00:00:00 +0000</pubDate>
      <guid>/2013/02/04/indexing-the-documents-stored-in-a-database-using-apache-solr-and-apache-tika/</guid>
      <description>Indexing the documents stored in a database&#xA;Outline:&#xA;Setup a MySQL database [1] containing documents( PDF/DOC/HTML etc ). Setup Apache Solr / Tika Import the documents just by hitting an import url. NOTE: Also check the update note at the end of this post.&#xA;These steps were done on my machine running Fedora 17. The commands be easliy converted for other distributions.&#xA;Setup MySQL database with documents&#xA;Install MySQL Server:</description>
    </item>
    <item>
      <title>Comparison on Lucene Solr and NoSQL</title>
      <link>/2011/07/09/comparison-on-lucene-solr-and-nosql/</link>
      <pubDate>Sat, 09 Jul 2011 00:00:00 +0000</pubDate>
      <guid>/2011/07/09/comparison-on-lucene-solr-and-nosql/</guid>
      <description>Comparison on Lucene/Solr and NoSQL&#xA;http://stackoverflow.com/questions/3215029/nosql-mongodb-vs-lucene-or-solr-as-your-database NoSQL, Lucene and Solr&#xA;http://www.lucidimagination.com/blog/2010/04/30/nosql-lucene-and-solr/ For The Guardian, Solr is the new database&#xA;http://www.lucidimagination.com/blog/2010/04/29/for-the-guardian-solr-is-the-new-database/ </description>
    </item>
    <item>
      <title>Is NoSQL database an alternative for a search engine?</title>
      <link>/2011/02/27/is-nosql-database-an-alternative-for-a-search-engine/</link>
      <pubDate>Sun, 27 Feb 2011 00:00:00 +0000</pubDate>
      <guid>/2011/02/27/is-nosql-database-an-alternative-for-a-search-engine/</guid>
      <description>I have been thinking about this question:&#xA;Is NoSQL database an alternative for a search engine?&#xA;I think I just found an answer here.&#xA;Lets talk about some terms and definitions first.&#xA;NoSQL - Not only SQL means that a NoSQL database differs from a RDBMS in some way.&#xA;IR - Information Retieval is the science of searching documents, their metadata, and retrieval.&#xA;Here we compare a NoSQL storage engine MongoDB, and Information Retrieval library Apache Lucene.</description>
    </item>
    <item>
      <title>PyLucene on Fedora 14</title>
      <link>/2011/02/25/pylucene-on-fedora-14/</link>
      <pubDate>Fri, 25 Feb 2011 00:00:00 +0000</pubDate>
      <guid>/2011/02/25/pylucene-on-fedora-14/</guid>
      <description>I couldn&amp;rsquo;t install pylucene simply by following command:&#xA;yum install pylucene Nor did following work:&#xA;easy_install pylucene Nor the following :-(&#xA;pip-python install pylucene So. I had to build it myself. Here, I list those steps:&#xA;Install JCC&#xA;$ JCC_JDK=/usr/lib/jvm/java-1.6.0-openjdk-1.6.0.0.x86_64 pip-python install jcc Download pylucene pylucene&#xA;wget -c http://apache.mirrors.pair.com//lucene/pylucene/pylucene-2.4.1-1-src.tar.gz tar zxf pylucene-2.4.1-1-src.tar.gz cd pylucene-2.4.1-1 Build and install http://lucene.apache.org/pylucene/documentation/install.html&#xA;pushd jcc # edit setup.py to match your environment JCC_JDK=/usr/lib/jvm/java-1.6.0-openjdk-1.6.0.0.x86_64 python setup.py build` sudo python setup.</description>
    </item>
  </channel>
</rss>
