In enterprise search, quality, usability and style are as important as relevancy of results and performance to engage your users right from the start.
Let’s take a look at typical scale-out scenarios that become relevant when implementing enterprise environments with Fabasoft Mindbreeze.
Starting with Entry-Level
The first configuration consists of a single Fabasoft Mindbreeze Appliance. It allows you to crawl all your data sources, analyze and process all your documents and records and provides comprehensive unified information access to all your users.
Don’t get me wrong, with “entry-level”: you can go all up to at least 6 million documents with one entry-level appliance. However, in large-scale enterprise environments, your data volume can easily blow this limit. So we go to the next level.
Taming huge data volume by automatic data partitioning
So what can you do if your data sources hold so many documents and records, that one server is not enough?
The scale-out configuration aims at large-scale enterprise infrastructures with terabytes of data from millions of documents and records. To support such scenarios, you only need to add additional appliance nodes.
Fabasoft Mindbreeze will automatically and uniformly distribute the documents over all available appliance nodes, thus scaling-out to match the data volume while maintaining excellent query performance.
You need highest availability? Index Replication is the way to go
Another axis of scale-out is targeted in the third configuration: High Availability End users demand their systems to be available 24/7 and to guarantee shortest query times.
Fabasoft Mindbreeze achieves that via index replication. By scaling-out an existing appliance infrastructure, the replica nodes inherit the data sources crawling capabilities as well as the queryable index of the original nodes.
Ultimate query performance through designated query nodes
Complex document analysis and document processing demands resources.
On the other hand, Web 2.0 applications and Cloud Computing demand high
availability with exponential growing numbers of concurrent users as the mission critical requirement.
For ultimate query performance, the fourth configuration decouples data source crawler (producer) nodes completely from query (consumer) nodes:
- Data sources can be crawled independently of the query nodes in terms of time and resource consumption.
- The indices produced by the crawler nodes can be replicated to the query nodes and made searchable without any downtime, using our Hot Reopen technology.
- Combined with a load-balancer, the appliances hosting the query nodes can be
scaled independently of the crawler nodes, to keep up with the growing number of concurrent users.