Java全文搜索框架,Lucene 5.3.0 發布

jopen 9年前發布 | 9K 次閱讀 Lucene

Lucene 是 apache 軟件基金會一個開放源代碼的全文檢索引擎工具包,是一個全文檢索引擎的架構,提供了完整的查詢引擎和索引引擎,部分文本分析引擎。 Lucene 的目的是為軟件開發人員提供一個簡單易用的工具包,以方便的在目標系統中實現全文檢索的功能,或者是以此為基礎建立起完整的全文檢索引擎。

Lucene 最初是由 Doug Cutting 所撰寫的,是一位資深全文索引/檢索專家,曾經是V-Twin 搜索引擎的主要開發者,后來在 Excite 擔任高級系統架構設計師,目前從事于一些 INTERNET 底層架構的研究。他貢獻出 Lucene 的目標是為各種中小型應用程式加入全文檢索功能。

Lucene 5.3.0 發布,此版本值得關注的更新如下:

API 改進:PhraseQuery 和 BooleanQuery 不可變 

新特性:

  • Added a new org.apache.lucene.search.join.CheckJoinIndex class that can be used to validate that an index has an appropriate structure to run join queries 

    </li>

  • Added a new BlendedTermQuery to blend statistics across several terms 

    </li>

  • New common suggest API that mirrors Lucene's Query/IndexSearcher APIs for Document based suggester. 

    </li>

  • IndexWriter can now be initialized from an already open near-real-time or non-NRT reader 

    </li>

  • Add experimental range tree doc values format and queries, based on a 1D version of the spatial BKD tree, for a faster and smaller alternative to postings-based numeric and binary term filtering.  Range trees can also handle values larger than 64 bits.

    </li> </ul>

    Geo 相關特性和改進:

    • Added GeoPointField, GeoPointInBBoxQuery, GeoPointInPolygonQuery for simple "indexed lat/lon point in bbox/shape" searching

      </li>

    • Added experimental BKD geospatial tree doc values format and queries, for fast "bbox/polygon contains lat/lon points"

      </li>

    • Use doc values to post-filter GeoPointField hits that fall in boundary cells, resulting in smaller index, faster searches and less heap used for each query

      </li> </ul>

      優化:

      • Reduce RAM usage of FieldInfos, and speed up lookup by number, by using an array instead of TreeMap except in very sparse cases

        </li>

      • Faster intersection of the terms dictionary with very finite automata, which can be generated eg. by simple regexp queries

        </li>

      • Various bugfixes and optimizations since the 5.2.0 release.

        </li> </ul>

        下載:http://www.apache.org/dyn/closer.cgi/lucene/java/5.3.0  


         本文由用戶 jopen 自行上傳分享,僅供網友學習交流。所有權歸原作者,若您的權利被侵害,請聯系管理員。
         轉載本站原創文章,請注明出處,并保留原始鏈接、圖片水印。
         本站是一個以用戶分享為主的開源技術平臺,歡迎各類分享!