Full-text search

« Back to Glossary Index

Full-text search and indexing
– Full-text search is divided into indexing and searching when dealing with a large number of documents or substantial search queries.
– The indexing stage scans the text of all documents and builds a list of search terms (index).
– Stop words, common and meaningless words, are ignored during indexing.
– Language-specific stemming is used to record words with similar concepts under a single index entry.

Precision vs. recall tradeoff
– Recall measures the quantity of relevant results returned by a search, while precision measures the quality of the results.
– Low-precision, low-recall search results in a small number of relevant results returned.
– Full-text search systems use options like stop words and stemming to increase precision and recall.
– Controlled-vocabulary searching helps eliminate ambiguities and improve precision.
– There is a trade-off between precision and recall: increasing precision may lower recall and vice versa.

False-positive problem
– Full-text searching often retrieves irrelevant documents, called false positives.
– False positives are caused by the inherent ambiguity of natural language.
– Clustering techniques based on Bayesian algorithms can reduce false positives.
– Clustering categorizes documents based on relevant words, improving search results.
– This technique is extensively used in the e-discovery domain.

Performance improvements and improved querying tools
– Full text searching deficiencies are addressed by providing users with improved querying tools.
– Keywords improve recall by including synonyms of words that describe the subject.
– Field-restricted search limits searches to a specific field within a data record.
– Boolean queries using operators like AND, NOT, and OR increase precision.
– Phrase search matches documents containing a specified phrase.
– Concept search matches multi-word concepts, such as compound term processing.
– Concordance search produces an alphabetical list of principal words with their context.
– Proximity search matches documents with words separated by a specified number of words.
– Regular expression employs a complex querying syntax for precise retrieval conditions.
– Fuzzy search looks for documents that match given terms with some variation around them.

Software and references
– Thunderstone Software LLC
– Vespa
– Vivísimo
– [Other software products for full-text indexing and searching]
– In practice, it may be difficult to determine how a given search engine works.
– The search algorithms employed by web-search services are seldom fully disclosed.
– Capabilities of Full Text Search System (Archived from the original on December 23, 2010)
– Coles, Michael (2008). Pro Full-Text Search in SQL Server 2008 (Version 1ed.). Apress Publishing Company. ISBN978-1-4302-1594-3.
– B., Yuwono; Lee, D. L. (1996). Search and ranking algorithms for locating resources on the World Wide Web. 12th International Conference on Data Engineering (ICDE96). p.164.

Full-text search (Wikipedia)

In text retrieval, full-text search refers to techniques for searching a single computer-stored document or a collection in a full-text database. Full-text search is distinguished from searches based on metadata or on parts of the original texts represented in databases (such as titles, abstracts, selected sections, or bibliographical references).

In a full-text search, a search engine examines all of the words in every stored document as it tries to match search criteria (for example, text specified by a user). Full-text-searching techniques appeared in the 1960s, for example IBM STAIRS from 1969, and became common in online bibliographic databases in the 1990s.[verification needed] Many websites and application programs (such as word processing software) provide full-text-search capabilities. Some web search engines, such as the former AltaVista, employ full-text-search techniques, while others index only a portion of the web pages examined by their indexing systems.

« Back to Glossary Index

Submit your RFP

We can't wait to read about your project. Use the form below to submit your RFP!

Gabrielle Buff
Gabrielle Buff

Just left us a 5 star review

Great customer service and was able to walk us through the various options available to us in a way that made sense. Would definitely recommend!

Stoute Web Solutions has been a valuable resource for our business. Their attention to detail, expertise, and willingness to help at a moment's notice make them an essential support system for us.

Paul and the team are very professional, courteous, and efficient. They always respond immediately even to my minute concerns. Also, their SEO consultation is superb. These are good people!

Paul Stoute & his team are top notch! You will not find a more honest, hard working group whose focus is the success of your business. If you’re ready to work with the best to create the best for your business, go Stoute Web Solutions; you’ll definitely be glad you did!

Wonderful people that understand our needs and make it happen!

Paul is the absolute best! Always there with solutions in high pressure situations. A steady hand; always there when needed; I would recommend Paul to anyone!

facebook
Vince Fogliani
recommends

The team over at Stoute web solutions set my business up with a fantastic new website, could not be happier

facebook
Steve Sacre
recommends

If You are looking for Website design & creativity look no further. Paul & his team are the epitome of excellence.Don't take my word just refer to my website "stevestours.net"that Stoute Web Solutions created.This should convince anyone that You have finally found Your perfect fit

facebook
Jamie Hill
recommends

Paul and the team at Stoute Web are amazing. They are super fast to answer questions. Super easy to work with, and knows their stuff. 10,000 stars.

Paul and the team from Stoute Web solutions are awesome to work with. They're super intuitive on what best suits your needs and the end product is even better. We will be using them exclusively for our web design and hosting.

facebook
Dean Eardley
recommends

Beautifully functional websites from professional, knowledgeable team.

Along with hosting most of my url's Paul's business has helped me with website development, graphic design and even a really cool back end database app! I highly recommend him as your 360 solution to making your business more visible in today's social media driven marketplace.

I hate dealing with domain/site hosts. After terrible service for over a decade from Dreamhost, I was desperate to find a new one. I was lucky enough to win...

Paul Stoute has been extremely helpful in helping me choose the best package to suite my needs. Any time I had a technical issue he was there to help me through it. Superb customer service at a great value. I would recommend his services to anyone that wants a hassle free and quality experience for their website needs.

Paul is the BEST! I am a current customer and happy to say he has never let me down. Always responds quickly and if he cant fix the issue right away, if available, he provides you a temporary work around while researching the correct fix! Thanks for being an honest and great company!!

Paul Stoute is absolutely wonderful. Paul always responds to my calls and emails right away. He is truly the backbone of my business. From my fantastic website to popping right up on Google when people search for me and designing my business cards, Paul has been there every step of the way. I would recommend this company to anyone.

I can't say enough great things about Green Tie Hosting. Paul was wonderful in helping me get my website up and running quickly. I have stayed with Green...