We having a really hard time to figure out two similar strings given to sys.dm_fts_parser gives different results
select * from sys.dm_fts_parser('"0 CAD"', 0, null, 0)
seems to think that "0 CAD" is one token (returns 2 token)
select * from sys.dm_fts_parser('"0 cad"', 0, null, 0)
returns 3 tokens - correctly
more importantly and even more confusing is why
select * from Table where contains(*,"point 5 CAD")
works and
select * from Table where contains(*,"point 5 cad")
fails
where the column searched contains "point 5 CAD" -
Shouldn't the full text index builder either ignore noise words (e.g. "5") based upon the index setting or include it.
We have tried both and cant explain why "nnnn CAD" is something special
note that full text is suppose to be case-insensitive according to http://msdn.microsoft.com/en-us/library/ms142583.aspx
What am I missing?
Edit: Using SQL 2012 11.0.2218
When using SQL 2008
On SQL 2012 (11.0.3218):
In SQL 2012 Microsoft introduced a new word breaker (version 14.0.4763.1000) http://msdn.microsoft.com/en-us/library/gg509108.aspx
It seems that the work-breaker now recognizes 3 character ISO 4217 Currency Codes, and if there is a number prior to the 3 char code it is not broken up.