Av rating:
Total votes: 19
Total comments: 3


Robert Sheldon
Full-Text Indexing Workbench
02 February 2009

/* Robert Sheldon continues his guide to Full-Text Indexing, by putting down his pen and reaching for SSMS for a practical workbench on Full-Text indexing. There is nothing like trying things out to make ideas click. */

/*

After you set up a full-text index on a table in a SQL Server 2005 or SQL Server 2008 database, you can perform a full-text search on the indexed columns in the table. To perform a full-text search, you can use the CONTAINS predicate or the FREETEXT predicate in your query's WHERE clause.

 

This workbench provides you with examples of how to use these predicates to perform a full-text search. (Note that SQL Server also supports two full-text functions, CONTAINSTABLE and FREETEXT table, but this workbench focuses only on the predicates.)

 

When you include the CONTAINS or FREETEXT predicate in your WHERE clause, the query engine searches the columns that are specified in the predicate arguments. These columns must be included in the full-text index that is defined on the specified table. The predicates also let you make use ofthe thesaurus that is available for any of the supported languages.

 

If you're new to full-text indexes and searches, you should first review the Simple-Talk article "Understanding Full-Text Indexing in SQL Server," published December 29, 2008. The article describes how full-text indexes are implemented in SQL Server 2005 and 2008, and provides examples of how to create those indexes.

*/

 

/*

To run the examples in this workbench, you should first set up the necessary environment to test the full-text queries. The following T-SQL statements create the StormyWeather table, populate the table, create the ftcStormyWeather full-text catalog, and then create a full text index on the table. The index is added to the ftcStormyWeather catalog.

*/

 

-- Create the StormyWeather table.

USE AdventureWorks2008 --replace with correct DB name

GO

IF OBJECT_ID (N'StormyWeather', N'U') IS NOT NULL

DROP TABLE StormyWeather

GO

CREATE TABLE StormyWeather (

  StormID INT NOT NULL IDENTITY,

  StormHead NVARCHAR(50) NOT NULL,

  StormBody NVARCHAR(MAX) NOT NULL,

  CONSTRAINT [PK_StormyWeather_StormID] PRIMARY KEY CLUSTERED (StormID ASC)

)

GO

-- Populate the StormyWeather table with data

-- that supports various full-text query types.

INSERT INTO StormyWeather (StormHead, StormBody)

VALUES('Stormy Weather Delays Travel',

  'The stormy weather made travel by motor vehicle difficult.')

INSERT INTO StormyWeather (StormHead, StormBody)

VALUES('Stormier Weather on Monday',

  'The stormier weather on Monday made vehicle travel difficult.')

INSERT INTO StormyWeather (StormHead, StormBody)

VALUES('Stormiest Weather in December',

  'December can be the stormiest month, making automobile travel difficult.')

INSERT INTO StormyWeather (StormHead, StormBody)

VALUES('Storm Grows Strong',

  'The storm is growing strong.')

INSERT INTO StormyWeather (StormHead, StormBody)

VALUES('Storms Crossing the Pacific',

  'The storms are lining up across the Pacific Ocean.')

INSERT INTO StormyWeather (StormHead, StormBody)

VALUES('Storm''s Wind Delays Travel',

  'The storm''s wind made car travel difficult on Tuesday.')

INSERT INTO StormyWeather (StormHead, StormBody)

VALUES('Storms'' Flooding Delays Travel',

  'The storms'' flooding made auto travel difficult throughout December.')

INSERT INTO StormyWeather (StormHead, StormBody)

VALUES('Children Run from Room',

  'The children often storm out of the room when upset.')

INSERT INTO StormyWeather (StormHead, StormBody)

VALUES('Boy Runs from Room',

  'The boy storms out of the room when his sister changes the channel.')

INSERT INTO StormyWeather (StormHead, StormBody)

VALUES('Girl Ran from Room',

  'The girl stormed out of the room when her brother ate the cookie.')

INSERT INTO StormyWeather (StormHead, StormBody)

VALUES('Children Running from Room',

  'The children were storming out of the room when the lights went out.')

GO

-- Create a full-text catalog and set it as the default.

CREATE FULLTEXT CATALOG ftcStormyWeather

AS DEFAULT

GO

-- Create a full-text index on the StormyWeather table.

-- Add the index to the ftcStormyWeather catalog.

CREATE FULLTEXT INDEX ON StormyWeather(StormHead, StormBody)

KEY INDEX PK_StormyWeather_StormID

ON ftcStormyWeather

GO

 

/*

Use the CONTAINS predicate to search the columns included in the full-text index. The CONTAINS arguments must be enclosed in parentheses. Multiple columns must be separated by a comma and enclosed in parentheses. The search condition (the term or terms) must be enclosed in single quotes.

*/

 

-- Search a single column for a single term.

SELECT StormID, StormHead, StormBody FROM StormyWeather

WHERE CONTAINS(StormHead, 'storm')

 

/*

The statement returns the following results:

4  Storm Grows Strong            The storm is growing strong.

6  Storm's Wind Delays Travel    The storm's wind made car travel difficult on Tuesday.

*/

 

-- Search multiple columns for a single term.

SELECT StormID, StormHead, StormBody FROM StormyWeather

WHERE CONTAINS((StormHead, StormBody), 'storm')

 

/*

The statement returns the following results:

4  Storm Grows Strong            The storm is growing strong.

6  Storm's Wind Delays Travel    The storm's wind made car travel difficult on Tuesday.

8  Children Run from Room        The children often storm out of the room when upset.

 

The next statement returns the same results.

*/

 

-- Use an asterisk (*) wildcard instead of column names

-- to search all full-text columns.

SELECT StormID, StormHead, StormBody FROM StormyWeather

WHERE CONTAINS(*, 'storm')

 

-- When searching multiple terms, use a comparative

-- operator, such as OR or AND, to separate the terms.

-- Individual terms should be enclosed in double quotes.

SELECT StormID, StormHead, StormBody FROM StormyWeather

WHERE CONTAINS(StormHead, '"storm" OR "storms" OR "stormy" OR "stormier" OR "stormiest"')

 

/*

The statement returns the following results:

1  Stormy Weather Delays Travel   The stormy weather made travel by motor vehicle difficult.

2  Stormier Weather on Monday     The stormier weather on Monday made vehicle travel difficult.

3  Stormiest Weather in December  December can be the stormiest month, making automobile travel difficult.

4  Storm Grows Strong             The storm is growing strong.

5  Storms Crossing the Pacific    The storms are lining up across the Pacific Ocean.

6  Storm's Wind Delays Travel     The storm's wind made car travel difficult on Tuesday.

7  Storms' Flooding Delays Travel The storms' flooding made auto travel difficult throughout December.

*/

 

/*

Rather than specify multiple terms, you can use a 'prefix term' if the terms begin with the same characters.

To use a prefix term, specify the beginning characters, then add an asterisk (*) wildcard to the end of the term. Enclose the prefix term in double quotes.

The following statement returns the same results as the previous one.

*/

 

-- Search for all terms that begin with 'storm'

SELECT StormID, StormHead, StormBody FROM StormyWeather

WHERE CONTAINS(StormHead, '"storm*"')

 

/*

Not all related terms can be effectively consolidated into a prefix term. For example, 'run' and 'ran' would require "r*", which would match all words beginning with 'r'. In these cases, you can specify each inflection of the word, such as '"run" OR "ran" OR "runs"', or you can use FORMSOF in your CONTAINS predicate.

*/

 

-- Specify each inflection.

SELECT StormID, StormHead, StormBody FROM StormyWeather

WHERE CONTAINS(StormHead, '"run" OR "runs" OR "ran" OR "running"')

 

/*

The statement returns the following results:

8  Children Run from Room     The children often storm out of the room when upset.

9  Boy Runs from Room         The boy storms out of the room when his sister changes the channel.

10 Girl Ran from Room         The girl stormed out of the room when her brother ate the cookie.

11 Children Running from Room The children were storming out of the room when the lights went out.

 

The following statement returns the same results as the previous one.

*/

 

-- Use the FORMSOF and INFLECTIONAL keywords, along with the root word.

SELECT StormID, StormHead, StormBody FROM StormyWeather

WHERE CONTAINS(StormHead, 'FORMSOF(INFLECTIONAL, run)')

 

/*

As you can see in the previous examples, the CONTAINS predicate returns an exact match, unless you specify a prefix term or FORMSOF. However, these methods will not work for different words with similar meanings, such as 'car' and 'automobile'. In these cases, you can use a thesaurus to match these types of terms.

 

For example, you can update the tsenu.xml thesaurus file (for LCID 1033) by adding the following elements:

 

    <thesaurus xmlns="x-schema:tsSchema.xml">

      <diacritics_sensitive>0</diacritics_sensitive>

        <expansion>

            <sub>car</sub>

            <sub>auto</sub>

            <sub>automobile</sub>

            <sub>vehicle</sub>

            <sub>motor vehicle</sub>

        </expansion>

    </thesaurus>

   

When you add these elements to your thesaurus file, the full-text search engine can then treat these terms the same. For example, if you search on 'car', 'auto' and 'automobile' will also be included in your search.

 

After you update a thesaurus file, you might need to reload it. In SQL Server 2008, you can use the sp_fulltext_load_thesaurus_file system stored procedure to reload the thesaurus file after you've updated it. In SQL Server 2005, you must restart the full-text search service.

 

Now when you search on car and specify the FORMSOF and THESAURUS keywords, your search will return all rows that include any of the terms specified in the thesaurus file.

*/

 

-- Use the term as is to return an exact match.

SELECT StormID, StormHead, StormBody FROM StormyWeather

WHERE CONTAINS(StormBody, 'car')

 

/*

The statement returns the following results:

6  Storm's Wind Delays Travel     The storm's wind made car travel difficult on Tuesday.

*/

 

-- Use the FORMSOF and THESAURUS keywords to use the thesaurus files.

SELECT StormID, StormHead, StormBody FROM StormyWeather

WHERE CONTAINS(StormBody, 'FORMSOF(THESAURUS, car)')

 

/*

The statement returns the following results:

1  Stormy Weather Delays Travel   The stormy weather made travel by motor vehicle difficult.

2  Stormier Weather on Monday     The stormier weather on Monday made vehicle travel difficult.

3  Stormiest Weather in December  December can be the stormiest month, making automobile travel difficult.

6  Storm's Wind Delays Travel     The storm's wind made car travel difficult on Tuesday.

7  Storms' Flooding Delays Travel The storms' flooding made auto travel difficult throughout December.

*/

 

/*

You can also use the NEAR keyword between terms in your search condition to specify words or phrases that must be near to each other.

*/

 

-- Use near to return rows in which 'travel' is near forms of 'storm'.

SELECT StormID, StormHead, StormBody FROM StormyWeather

WHERE CONTAINS(StormBody, '"storm*" NEAR travel')

 

/*

The statement returns the following results:

1  Stormy Weather Delays Travel   The stormy weather made travel by motor vehicle difficult.

2  Stormier Weather on Monday     The stormier weather on Monday made vehicle travel difficult.

3  Stormiest Weather in December  December can be the stormiest month, making automobile travel difficult.

6  Storm's Wind Delays Travel     The storm's wind made car travel difficult on Tuesday.

7  Storms' Flooding Delays Travel The storms' flooding made auto travel difficult throughout December.

*/

 

-- You can use NEAR to chain together multiple terms.

SELECT StormID, StormHead, StormBody FROM StormyWeather

WHERE CONTAINS(StormBody, '"storm*" NEAR travel NEAR Tuesday')

 

/*

The statement returns the following results:

6  Storm's Wind Delays Travel    The storm's wind made car travel difficult on Tuesday.

*/

 

/*

The full-text queries in the examples here are relatively straightforward. However, these queries can get quite complicated. Fortunately, the CONTAINS  predicate supports a number of methods that let you simplify your queries (such as the prefix term). In addition, you can use the FREETEXT predicate to simplify your queries even more. FREETEXT treats each word in a phrase as a separate term and automatically finds different inflections for that term and applies the appropriate thesaurus files.

*/

 

-- Define each form of a term in your search condition.

SELECT StormID, StormHead, StormBody FROM StormyWeather

WHERE CONTAINS(StormBody, '"weather" OR "flood" 

  OR "flooding" OR "car" OR "auto" OR "automobile" OR "vehicle" OR "motor vehicle"')

 

/*

The statement returns the following results:

1  Stormy Weather Delays Travel   The stormy weather made travel by motor vehicle difficult.

2  Stormier Weather on Monday     The stormier weather on Monday made vehicle travel difficult.

3  Stormiest Weather in December  December can be the stormiest month, making automobile travel difficult.

6  Storm's Wind Delays Travel     The storm's wind made car travel difficult on Tuesday.

7  Storms' Flooding Delays Travel The storms' flooding made auto travel difficult throughout December.

 

The following two statements return the same results as the previous one.

*/

 

-- When possible, use prefix terms, thesaurus files, or other devices

-- to simplify your full-text queries.

SELECT StormID, StormHead, StormBody FROM StormyWeather

WHERE CONTAINS(StormBody, '"weather" OR "flood*" OR FORMSOF(THESAURUS, car)')

 

-- You can also use FREETEXT when applicable to search for terms.

SELECT StormID, StormHead, StormBody FROM StormyWeather

WHERE FREETEXT(StormBody, 'weather flood car')

 

/*

That's all there is to using the CONTAINS and FREETEXT predicates. Keep in mind that CONTAINS is more precise than FREETEXT. And, of course, a full-text search can be much more complex than shown in the examples here. Be sure to check out SQL Server Books Online for more details about both of these predicates.

*/

 



This article has been viewed 4269 times.
Robert Sheldon

Author profile: Robert Sheldon

After being dropped 35 feet from a helicopter and spending the next year recovering, Robert Sheldon left the Colorado Rockies and emergency rescue work to pursue safer and less painful interests—thus his entry into the world of technology. He is now a technical consultant and the author of numerous books, articles, and training material related to Microsoft Windows, various relational database management systems, and business intelligence design and implementation. He has also written news stories, feature articles, restaurant reviews, legal summaries, and the novel 'Dancing the River Lightly'. You can find more information at http://www.rhsheldon.com.

Search for other articles by Robert Sheldon

Rate this article:   Avg rating: from a total of 19 votes.


Poor

OK

Good

Great

Must read
 
Have Your Say
Do you have an opinion on this article? Then add your comment below:
You must be logged in to post to this forum

Click here to log in.


Subject: hi
Posted by: raymon2683 (view profile)
Posted on: Tuesday, February 03, 2009 at 5:51 AM
Message: hi, i really like your article upon full texts , i learned a lot.
But i had a class where i was supposed to implement full text search on a table on sql server 2000.
I made sure that the search service was installed during the sql server instance creation(QTR7).

When i tried to enable the full text catalog :

use Northwind
exec sp_fulltext_database 'enable'

i get the following error:
Msg 7608 .An unknown full-text Failure(80004002)
occured in function EnumCatalogs on full-text catalog.

Subject: Hi
Posted by: raymon2683 (view profile)
Posted on: Tuesday, February 03, 2009 at 5:55 AM
Message: Hi,i really appreciate the article and i learned a lot.
But here i am , i was handling a beginner class on full text search on sql 2000 server.
When i try to enable the full text database i get the following error:

Msg 7608, level 17, State 1, Procedure sp_fulltext_database, Line 77.
An unknown full-text Failure(80004002) occurred in function EnumCatalogs on full-text catalog.

i tried to check the account privileges but all was right .
Can you help.

Subject: Full text indexing
Posted by: Vinit gaur (not signed in)
Posted on: Thursday, February 05, 2009 at 10:57 PM
Message: This is very nice and helpful for all.

 










Phil Factor
Finding Stuff in SQL Server Database DDL
 You'd have thought that nothing would be easier than using SQL Server Management Studio (SSMS) for searching... Read more...



 View the blog
Implementing User-Defined Hierarchies in SQL Server Analysis Services
 To be able to drill into multidimensional cube data at several levels, you must implement all of the... Read more...

Using the Filtering API with the SQL Comparison SDK
 Red Gate's SQL Comparison SDK provides a means to compare and synchronize database schemas and data... Read more...

SQL Toolbelt 2008: Predominantly an Engineering Task
 The conversion of the Red-Gate tools to be compatible with SQL Server 2008 might not seem, on first... Read more...

SQL Response: The dim sum interview
 Richard Morris met David and Nigel of the SQL Response team, in a dim sum Restaurant in Cambridge. They... Read more...

Writing Efficient SQL: Set-Based Speed Phreakery
 Phil Factor's SQL Speed Phreak challenge is an event where coders battle to produce the fastest code to... Read more...

Beginning SQL Server 2005 Reporting Services Part 1
 Steve Joubert begins an in-depth tour of SQL Server 2005 Reporting Services with a step-by-step guide... Read more...

Ten Common Database Design Mistakes
 Database design and implementation is the cornerstone of any data centric project (read 99.9% of... Read more...

Beginning SQL Server 2005 Reporting Services Part 2
 Continuing his in-depth tour of SQL Server 2005 Reporting Services, Steve Joubert demonstrates the most... Read more...

SQL Server Full Text Search Language Features
 SQL Full-text Search (SQL FTS) is an optional component of SQL Server 7 and later, which allows fast... Read more...

Creating CSV Files Using BCP and Stored Procedures
 Nigel Rivett demonstrates some core techniques for extracting SQL Server data into CSV files, focussing... Read more...

Over 150,000 Microsoft professionals subscribe to the Simple-Talk technical journal. Join today, it's fast, simple, free and secure.

Join Simple Talk