Indexing service and hyphens - Inetserver
This is a discussion on Indexing service and hyphens - Inetserver ; I am trying to search for a word such as "e-business" using the Indexing
Service Query object (CissoQuery). Now what I would like to do is to be able
to search for e-bus and return results of variations of this ...
-
Indexing service and hyphens
I am trying to search for a word such as "e-business" using the Indexing
Service Query object (CissoQuery). Now what I would like to do is to be able
to search for e-bus and return results of variations of this term, e.g.
e-business, e-busi. So effectively, I would like to a do a wildcard search.
Unfortunately, when I search for this term, it returns to me documents that
do not have e-business in them but variations of e (I have modified the noise
list to remove noise words) and business as well as ebusiness. I don't want
this to happen. I can search for the phrase "e-business" and it returns the
correct results back. However if I search for "e-bus" it returns no results
back because it is looking for the entire phrase. If I search for e-business
without the quotes, I get the variations of which I talked about earlier for
documents that don't contain that phrase. How do I configure Indexing
Service to return me results with hyphens back. I have yet to find any
answer on the web anywhere where this question has been asked sufficiently.
If this is a bug and cannot be done in indexing service, please tell me and I
will stop attempting to try and figure this out. I am aware that this is a
general indexing service question but I know sql server uses the service
internally or something like it, so I am posting this question to this
newsgroup.
-
Re: Indexing service and hyphens
unfortunately Indexing Services considers the hyphen to be a token delimiter
and as such will consider e-business two words or tokens, e and business.
When you wildcard the wildcarding is done on each token in your search
phrase, so both e and bus will be wildcarded.
Your best bet is to replicate the - with a literal HYPEN in your content and
your search phrase. Then when displaying results to your users you will have
to fix the displayed results to display correctly.
--
Hilary Cotter
Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html
Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com
"Hammad" <Hammad@discussions.microsoft.com> wrote in message
news:44C05CE8-5C99-4509-A17B-3CEE9F57963F@microsoft.com...
> I am trying to search for a word such as "e-business" using the Indexing
> Service Query object (CissoQuery). Now what I would like to do is to be
able
> to search for e-bus and return results of variations of this term, e.g.
> e-business, e-busi. So effectively, I would like to a do a wildcard
search.
> Unfortunately, when I search for this term, it returns to me documents
that
> do not have e-business in them but variations of e (I have modified the
noise
> list to remove noise words) and business as well as ebusiness. I don't
want
> this to happen. I can search for the phrase "e-business" and it returns
the
> correct results back. However if I search for "e-bus" it returns no
results
> back because it is looking for the entire phrase. If I search for
e-business
> without the quotes, I get the variations of which I talked about earlier
for
> documents that don't contain that phrase. How do I configure Indexing
> Service to return me results with hyphens back. I have yet to find any
> answer on the web anywhere where this question has been asked
sufficiently.
> If this is a bug and cannot be done in indexing service, please tell me
and I
> will stop attempting to try and figure this out. I am aware that this is
a
> general indexing service question but I know sql server uses the service
> internally or something like it, so I am posting this question to this
> newsgroup.
-
Re: Indexing service and hyphens
Hi Hilary,
Thanks for the response.
I don't quite understand what you mean by the following:
" Your best bet is to replicate the - with a literal HYPEN in your content and
your search phrase. Then when displaying results to your users you will have
to fix the displayed results to display correctly."
Could you explain in more detail?
Thanks,
Hammad
"Hilary Cotter" wrote:
> unfortunately Indexing Services considers the hyphen to be a token delimiter
> and as such will consider e-business two words or tokens, e and business.
>
> When you wildcard the wildcarding is done on each token in your search
> phrase, so both e and bus will be wildcarded.
>
> Your best bet is to replicate the - with a literal HYPEN in your content and
> your search phrase. Then when displaying results to your users you will have
> to fix the displayed results to display correctly.
>
> --
> Hilary Cotter
> Looking for a SQL Server replication book?
> http://www.nwsu.com/0974973602.html
>
> Looking for a FAQ on Indexing Services/SQL FTS
> http://www.indexserverfaq.com
>
> "Hammad" <Hammad@discussions.microsoft.com> wrote in message
> news:44C05CE8-5C99-4509-A17B-3CEE9F57963F@microsoft.com...
> > I am trying to search for a word such as "e-business" using the Indexing
> > Service Query object (CissoQuery). Now what I would like to do is to be
> able
> > to search for e-bus and return results of variations of this term, e.g.
> > e-business, e-busi. So effectively, I would like to a do a wildcard
> search.
> > Unfortunately, when I search for this term, it returns to me documents
> that
> > do not have e-business in them but variations of e (I have modified the
> noise
> > list to remove noise words) and business as well as ebusiness. I don't
> want
> > this to happen. I can search for the phrase "e-business" and it returns
> the
> > correct results back. However if I search for "e-bus" it returns no
> results
> > back because it is looking for the entire phrase. If I search for
> e-business
> > without the quotes, I get the variations of which I talked about earlier
> for
> > documents that don't contain that phrase. How do I configure Indexing
> > Service to return me results with hyphens back. I have yet to find any
> > answer on the web anywhere where this question has been asked
> sufficiently.
> > If this is a bug and cannot be done in indexing service, please tell me
> and I
> > will stop attempting to try and figure this out. I am aware that this is
> a
> > general indexing service question but I know sql server uses the service
> > internally or something like it, so I am posting this question to this
> > newsgroup.
>
>
>
-
Re: Indexing service and hyphens
what I mean is that you will have to go through your content and every time
you see e-business you will have to replace it with eHYPHENbusiness. The
when you search you will have to trap for searching on e-business and
replace it with a search on eHYPHENbusiness.
--
Hilary Cotter
Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html
Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com
"Hammad" <Hammad@discussions.microsoft.com> wrote in message
news:336F2981-DCF1-45C0-9B35-E1139BF92517@microsoft.com...
> Hi Hilary,
>
> Thanks for the response.
>
> I don't quite understand what you mean by the following:
>
> " Your best bet is to replicate the - with a literal HYPEN in your content
and
> your search phrase. Then when displaying results to your users you will
have
> to fix the displayed results to display correctly."
>
> Could you explain in more detail?
>
> Thanks,
>
> Hammad
>
>
> "Hilary Cotter" wrote:
>
> > unfortunately Indexing Services considers the hyphen to be a token
delimiter
> > and as such will consider e-business two words or tokens, e and
business.
> >
> > When you wildcard the wildcarding is done on each token in your search
> > phrase, so both e and bus will be wildcarded.
> >
> > Your best bet is to replicate the - with a literal HYPEN in your content
and
> > your search phrase. Then when displaying results to your users you will
have
> > to fix the displayed results to display correctly.
> >
> > --
> > Hilary Cotter
> > Looking for a SQL Server replication book?
> > http://www.nwsu.com/0974973602.html
> >
> > Looking for a FAQ on Indexing Services/SQL FTS
> > http://www.indexserverfaq.com
> >
> > "Hammad" <Hammad@discussions.microsoft.com> wrote in message
> > news:44C05CE8-5C99-4509-A17B-3CEE9F57963F@microsoft.com...
> > > I am trying to search for a word such as "e-business" using the
Indexing
> > > Service Query object (CissoQuery). Now what I would like to do is to
be
> > able
> > > to search for e-bus and return results of variations of this term,
e.g.
> > > e-business, e-busi. So effectively, I would like to a do a wildcard
> > search.
> > > Unfortunately, when I search for this term, it returns to me documents
> > that
> > > do not have e-business in them but variations of e (I have modified
the
> > noise
> > > list to remove noise words) and business as well as ebusiness. I
don't
> > want
> > > this to happen. I can search for the phrase "e-business" and it
returns
> > the
> > > correct results back. However if I search for "e-bus" it returns no
> > results
> > > back because it is looking for the entire phrase. If I search for
> > e-business
> > > without the quotes, I get the variations of which I talked about
earlier
> > for
> > > documents that don't contain that phrase. How do I configure Indexing
> > > Service to return me results with hyphens back. I have yet to find
any
> > > answer on the web anywhere where this question has been asked
> > sufficiently.
> > > If this is a bug and cannot be done in indexing service, please tell
me
> > and I
> > > will stop attempting to try and figure this out. I am aware that this
is
> > a
> > > general indexing service question but I know sql server uses the
service
> > > internally or something like it, so I am posting this question to this
> > > newsgroup.
> >
> >
> >
Similar Threads
-
By Application Development in forum Inetserver
Replies: 0
Last Post: 12-01-2005, 09:35 AM
-
By Application Development in forum Inetserver
Replies: 0
Last Post: 09-04-2005, 12:37 PM
-
By Application Development in forum Inetserver
Replies: 4
Last Post: 07-19-2005, 05:36 AM
-
By Application Development in forum Inetserver
Replies: 2
Last Post: 07-19-2004, 10:45 AM
-
By Application Development in forum Inetserver
Replies: 1
Last Post: 04-14-2004, 10:00 AM