Profile cover photo
Profile photo
Carrot Search
20 followers -
Document clustering and visualization software
Document clustering and visualization software

20 followers
About
Posts

Lingo4G 1.8.0: faster and more robust indexing

We're pleased to announce the 1.8.0 release of Lingo4G, which comes with:

* More robust index storage format. We changed the index storage format to avoid potential problems with non-atomic index updates and to permit optimizations that make indexing faster..

* Incremental indexing improvements. Lingo4G server can now be stared with an empty index and hot-reloaded once documents get indexed. Incremental addition of documents has been improved to avoid long pauses caused by merging of large index segments.

* More efficient stop label extraction. Identification of common, meaningless phrases has been rewritten and is now much more efficient and memory-friendly.

Additionally, the 1.8.0 release comes with a number of small improvements and bug fixes.


Release notes:
https://get.carrotsearch.com/lingo4g/1.8.0/doc/releases.html#release-1.8.0


Download:
Upload your license file at https://secure.carrotsearch.com for download links.


Happy clustering!
Add a comment...

We'd like to announce the 1.7.1 release of Lingo4G with a number of bug fixes related to, among others, XML results export and launching Lingo4G under Cygwin.

Release notes:
https://get.carrotsearch.com/lingo4g/1.7.1/doc/releases.html#release-1.7.1

Download:
Upload your license file at https://secure.carrotsearch.com for download links.

Happy clustering!
Add a comment...

Lingo4G 1.7.0: scaling to terabyte-sized collections

We're pleased to announce the 1.7.0 release of Lingo4G, which comes with:

* Increased scalability of indexing and analysis. This release is about scaling Lingo4G to terabyte-sized data sets. With Lingo4G 1.7.0 you can significantly speed up indexing and analysis of such large collections.

* US Patent and Trademark Office data set. You can test the new scalability features with the new built-in data set covering more than 8 million patent grant and application documents, spanning nearly 500 GB of text.

* Label fetching rewritten. Release 1.7.0 improves the process of fetching labels describing the analyzed documents to better eliminate boiler-plate labels and increase performance when analyzing small subsets of very large indices.

On top of the above, version 1.7.0 makes the index smaller by 20-30%, adds the possibility to delete documents from the index, adds a number of minor REST API clean-ups and bug fixes.

Release notes:
https://get.carrotsearch.com/lingo4g/1.7.0/doc/releases.html#release-1.7.0

Download:
Upload your license file at https://secure.carrotsearch.com for download links.

Happy clustering!
Add a comment...

Lingo3G 1.16.0 released: Carrot2 framework update for Java 9+ compatibility.

Full release notes:
https://download.carrotsearch.com/lingo3g/1.16.0/manual#section.release-1.16.0
Add a comment...

Post has attachment
Dear All,

After a long stretch of work, Lingo4G 1.6.0 is finally available with two major new features:

* Incremental indexing. Starting with this release, you can add and update documents in the index without re-processing the whole collection. The newly added or modified documents can be included in analyses without restarting Lingo4G REST API.

* Spatial visualizations of document sets. Version 1.6.0 introduces the document embedding feature, which places documents in 2d space in such a way that textually-similar documents are close to each other. Based on document embedding, Lingo4G Explorer introduces the document map view, a tool for interactive visualization and exploration of document collections.

* Performance and functional improvements. New index storage format comes with indexing speed improvements and limits the number of passes over the document source's documents. Lingo4G Explorer now offers the document summary view, in-line help for parameters and many other improvements.

On top of the above, the 1.6.1 release improves reloading REST API when incremental indexing takes place, increase the performance of label coverage computation and tunes logging.

Release notes:
https://get.carrotsearch.com/lingo4g/1.6.0/doc/releases.html#release-1.6.0
https://get.carrotsearch.com/lingo4g/1.6.1/doc/releases.html#release-1.6.1

Download:
Upload your license file at https://secure.carrotsearch.com for download links.

Happy clustering!
Photo
Add a comment...

Lingo4G 1.5.2 released with a critical bug fix for label exclusion dictionaries | http://mailchi.mp/carrotsearch/l4g_release-1_5_2
Add a comment...

We'd like to announce the 1.5.1 release of Lingo4G with the following Lingo4G Explorer bug fixes:

Highlighting of selected document clusters. Lingo4G Explorer 1.5.0 might highlight a cluster different than the one the user clicked to select.

Lingo4G Explorer in Internet Explorer 11. Version 1.5.1 adds a special meta tag to prevent IE11 from switching to "compatibility mode" when accessing Lingo4G Explorer from an intranet address.


Release notes:
https://get.carrotsearch.com/lingo4g/1.5.1/doc/#release-1.5.1

Download:
Upload your license file at https://secure.carrotsearch.com for download links.

Happy clustering!
Add a comment...

FoamTree 3.4.5 released with better responsiveness with large hierarchies and bug fixes | Release notes: https://get.carrotsearch.com/foamtree/3.4.5/api/#release- 3.4.5 | Download: https://secure.carrotsearch.com
Add a comment...

We're pleased to announce the 1.5.0 release of Lingo4G. Highlights of this release include:

Hierarchical document clustering. Lingo4G can now create hierarchical arrangements of documents. The arrangements are presented in Lingo4G Explorer as a two-level structure of clusters sets containing document clusters.

Analysis scope size limit has been exposed in Lingo4G Explorer, so that you can process only the fraction of the (possibly many) documents matching your query.

Release notes: https://get.carrotsearch.com/lingo4g/1.5.0/doc/#release-1.5.0

Download: Upload your license file at https://secure.carrotsearch.com for download links.

Happy clustering!
Add a comment...


We're pleased to announce the 1.4.0 release of Lingo4G. Highlights of this release include:

Cancelling in-progress analyses. Lingo4G REST API now supports cancelling of in-progress tasks. You can use the new feature to avoid computing results that will no longer be needed because, for example, the user chose to cancel a long-running in-progress analysis.

Caching of source documents to improve indexing of documents from high-overhead sources (decompression, network access, data parsing).


Release notes:
https://get.carrotsearch.com/lingo4g/1.4.0/doc/#release-1.4.0

Download:
Upload your license file at https://secure.carrotsearch.com for download links.

Happy clustering!
Add a comment...
Wait while more posts are being loaded