Part – 1
World of Data
Most delicious food for every Tech or Tech based companies. Everyone wants to know their customers, their sales, their business. All those companies are gathering data. This data size getting larger each day.
We have data, petabytes of data. What to do with these? Well, it is simple, find information from there.
Ok, what information to find from there? Umm… before going to that point lets see..
What is Information
Your birthday, name, phone number, social security number, address, parents name, bought product, shoes you fit in, chips you like and… and.. the list is endless. But, all data is not information to everyone. An online bookstore like Rokomari may have use of your phone number, but your favourite pen brand is not a information to them (who knows, I might be proven wrong if they start to gift their customers their own favourite branded pen).
Companies have piles of information in their database. They just need to find the data on demand or when necessary. How to find? No problem at all, just put a “select like” query in MySQL and seat back resting your occiput on crossing fingers.
What is the catch here? Customers do not rest, individual can. Customers need info as they type, use backspace or type again. They do not have time to wait for a single moment, if they are kept in waiting then, they will move to other service provider who produces faster results than you, if you are in competitive business.
So, how to make finding information faster without making too much read operation on your database? No surprise, SOLR will help you to do so.
SOLR (pronounced as “solar”) is a Java written search platform. That means, you will store and search data here. We will discuss it broadly later..
SOLR is standalone full-text search server. While Lucene is the core of SOLR, it is capable of near-real-time (NRT) indexing, highlighting hit, facet search, database integration, dynamic clustering and handling pdf or word file. It is a product of Apache Software Foundation.
Let’s make ourselves clear about some words mentioned above:
Near-Real-Time Indexing (NRT)
Ready to search just after index.
Did you see your search term is bold during searching on Google, that is hit highlighting.
On Amazon, on the left side you can get filter option to get more accurate product. Those generated filter options are faceted search.
For now, this is all I want to share. I will be discussing more later.