Indexing service and hyphens - Inetserver

This is a discussion on Indexing service and hyphens - Inetserver ; I am trying to search for a word such as "e-business" using the Indexing Service Query object (CissoQuery). Now what I would like to do is to be able to search for e-bus and return results of variations of this ...

+ Reply to Thread
Results 1 to 4 of 4

Indexing service and hyphens

  1. Default Indexing service and hyphens

    I am trying to search for a word such as "e-business" using the Indexing
    Service Query object (CissoQuery). Now what I would like to do is to be able
    to search for e-bus and return results of variations of this term, e.g.
    e-business, e-busi. So effectively, I would like to a do a wildcard search.
    Unfortunately, when I search for this term, it returns to me documents that
    do not have e-business in them but variations of e (I have modified the noise
    list to remove noise words) and business as well as ebusiness. I don't want
    this to happen. I can search for the phrase "e-business" and it returns the
    correct results back. However if I search for "e-bus" it returns no results
    back because it is looking for the entire phrase. If I search for e-business
    without the quotes, I get the variations of which I talked about earlier for
    documents that don't contain that phrase. How do I configure Indexing
    Service to return me results with hyphens back. I have yet to find any
    answer on the web anywhere where this question has been asked sufficiently.
    If this is a bug and cannot be done in indexing service, please tell me and I
    will stop attempting to try and figure this out. I am aware that this is a
    general indexing service question but I know sql server uses the service
    internally or something like it, so I am posting this question to this
    newsgroup.

  2. Default Re: Indexing service and hyphens

    unfortunately Indexing Services considers the hyphen to be a token delimiter
    and as such will consider e-business two words or tokens, e and business.

    When you wildcard the wildcarding is done on each token in your search
    phrase, so both e and bus will be wildcarded.

    Your best bet is to replicate the - with a literal HYPEN in your content and
    your search phrase. Then when displaying results to your users you will have
    to fix the displayed results to display correctly.

    --
    Hilary Cotter
    Looking for a SQL Server replication book?
    http://www.nwsu.com/0974973602.html

    Looking for a FAQ on Indexing Services/SQL FTS
    http://www.indexserverfaq.com

    "Hammad" <Hammad@discussions.microsoft.com> wrote in message
    news:44C05CE8-5C99-4509-A17B-3CEE9F57963F@microsoft.com...
    > I am trying to search for a word such as "e-business" using the Indexing
    > Service Query object (CissoQuery). Now what I would like to do is to be

    able
    > to search for e-bus and return results of variations of this term, e.g.
    > e-business, e-busi. So effectively, I would like to a do a wildcard

    search.
    > Unfortunately, when I search for this term, it returns to me documents

    that
    > do not have e-business in them but variations of e (I have modified the

    noise
    > list to remove noise words) and business as well as ebusiness. I don't

    want
    > this to happen. I can search for the phrase "e-business" and it returns

    the
    > correct results back. However if I search for "e-bus" it returns no

    results
    > back because it is looking for the entire phrase. If I search for

    e-business
    > without the quotes, I get the variations of which I talked about earlier

    for
    > documents that don't contain that phrase. How do I configure Indexing
    > Service to return me results with hyphens back. I have yet to find any
    > answer on the web anywhere where this question has been asked

    sufficiently.
    > If this is a bug and cannot be done in indexing service, please tell me

    and I
    > will stop attempting to try and figure this out. I am aware that this is

    a
    > general indexing service question but I know sql server uses the service
    > internally or something like it, so I am posting this question to this
    > newsgroup.




  3. Default Re: Indexing service and hyphens

    Hi Hilary,

    Thanks for the response.

    I don't quite understand what you mean by the following:

    " Your best bet is to replicate the - with a literal HYPEN in your content and
    your search phrase. Then when displaying results to your users you will have
    to fix the displayed results to display correctly."

    Could you explain in more detail?

    Thanks,

    Hammad


    "Hilary Cotter" wrote:

    > unfortunately Indexing Services considers the hyphen to be a token delimiter
    > and as such will consider e-business two words or tokens, e and business.
    >
    > When you wildcard the wildcarding is done on each token in your search
    > phrase, so both e and bus will be wildcarded.
    >
    > Your best bet is to replicate the - with a literal HYPEN in your content and
    > your search phrase. Then when displaying results to your users you will have
    > to fix the displayed results to display correctly.
    >
    > --
    > Hilary Cotter
    > Looking for a SQL Server replication book?
    > http://www.nwsu.com/0974973602.html
    >
    > Looking for a FAQ on Indexing Services/SQL FTS
    > http://www.indexserverfaq.com
    >
    > "Hammad" <Hammad@discussions.microsoft.com> wrote in message
    > news:44C05CE8-5C99-4509-A17B-3CEE9F57963F@microsoft.com...
    > > I am trying to search for a word such as "e-business" using the Indexing
    > > Service Query object (CissoQuery). Now what I would like to do is to be

    > able
    > > to search for e-bus and return results of variations of this term, e.g.
    > > e-business, e-busi. So effectively, I would like to a do a wildcard

    > search.
    > > Unfortunately, when I search for this term, it returns to me documents

    > that
    > > do not have e-business in them but variations of e (I have modified the

    > noise
    > > list to remove noise words) and business as well as ebusiness. I don't

    > want
    > > this to happen. I can search for the phrase "e-business" and it returns

    > the
    > > correct results back. However if I search for "e-bus" it returns no

    > results
    > > back because it is looking for the entire phrase. If I search for

    > e-business
    > > without the quotes, I get the variations of which I talked about earlier

    > for
    > > documents that don't contain that phrase. How do I configure Indexing
    > > Service to return me results with hyphens back. I have yet to find any
    > > answer on the web anywhere where this question has been asked

    > sufficiently.
    > > If this is a bug and cannot be done in indexing service, please tell me

    > and I
    > > will stop attempting to try and figure this out. I am aware that this is

    > a
    > > general indexing service question but I know sql server uses the service
    > > internally or something like it, so I am posting this question to this
    > > newsgroup.

    >
    >
    >


  4. Default Re: Indexing service and hyphens

    what I mean is that you will have to go through your content and every time
    you see e-business you will have to replace it with eHYPHENbusiness. The
    when you search you will have to trap for searching on e-business and
    replace it with a search on eHYPHENbusiness.

    --
    Hilary Cotter
    Looking for a SQL Server replication book?
    http://www.nwsu.com/0974973602.html

    Looking for a FAQ on Indexing Services/SQL FTS
    http://www.indexserverfaq.com

    "Hammad" <Hammad@discussions.microsoft.com> wrote in message
    news:336F2981-DCF1-45C0-9B35-E1139BF92517@microsoft.com...
    > Hi Hilary,
    >
    > Thanks for the response.
    >
    > I don't quite understand what you mean by the following:
    >
    > " Your best bet is to replicate the - with a literal HYPEN in your content

    and
    > your search phrase. Then when displaying results to your users you will

    have
    > to fix the displayed results to display correctly."
    >
    > Could you explain in more detail?
    >
    > Thanks,
    >
    > Hammad
    >
    >
    > "Hilary Cotter" wrote:
    >
    > > unfortunately Indexing Services considers the hyphen to be a token

    delimiter
    > > and as such will consider e-business two words or tokens, e and

    business.
    > >
    > > When you wildcard the wildcarding is done on each token in your search
    > > phrase, so both e and bus will be wildcarded.
    > >
    > > Your best bet is to replicate the - with a literal HYPEN in your content

    and
    > > your search phrase. Then when displaying results to your users you will

    have
    > > to fix the displayed results to display correctly.
    > >
    > > --
    > > Hilary Cotter
    > > Looking for a SQL Server replication book?
    > > http://www.nwsu.com/0974973602.html
    > >
    > > Looking for a FAQ on Indexing Services/SQL FTS
    > > http://www.indexserverfaq.com
    > >
    > > "Hammad" <Hammad@discussions.microsoft.com> wrote in message
    > > news:44C05CE8-5C99-4509-A17B-3CEE9F57963F@microsoft.com...
    > > > I am trying to search for a word such as "e-business" using the

    Indexing
    > > > Service Query object (CissoQuery). Now what I would like to do is to

    be
    > > able
    > > > to search for e-bus and return results of variations of this term,

    e.g.
    > > > e-business, e-busi. So effectively, I would like to a do a wildcard

    > > search.
    > > > Unfortunately, when I search for this term, it returns to me documents

    > > that
    > > > do not have e-business in them but variations of e (I have modified

    the
    > > noise
    > > > list to remove noise words) and business as well as ebusiness. I

    don't
    > > want
    > > > this to happen. I can search for the phrase "e-business" and it

    returns
    > > the
    > > > correct results back. However if I search for "e-bus" it returns no

    > > results
    > > > back because it is looking for the entire phrase. If I search for

    > > e-business
    > > > without the quotes, I get the variations of which I talked about

    earlier
    > > for
    > > > documents that don't contain that phrase. How do I configure Indexing
    > > > Service to return me results with hyphens back. I have yet to find

    any
    > > > answer on the web anywhere where this question has been asked

    > > sufficiently.
    > > > If this is a bug and cannot be done in indexing service, please tell

    me
    > > and I
    > > > will stop attempting to try and figure this out. I am aware that this

    is
    > > a
    > > > general indexing service question but I know sql server uses the

    service
    > > > internally or something like it, so I am posting this question to this
    > > > newsgroup.

    > >
    > >
    > >




+ Reply to Thread

Similar Threads

  1. OCR/indexing TIF images using Indexing service
    By Application Development in forum Inetserver
    Replies: 0
    Last Post: 12-01-2005, 09:35 AM
  2. Indexing Service
    By Application Development in forum Inetserver
    Replies: 0
    Last Post: 09-04-2005, 12:37 PM
  3. Indexing Service Not Indexing All Files On Windows 2003 Server Stnd
    By Application Development in forum Inetserver
    Replies: 4
    Last Post: 07-19-2005, 05:36 AM
  4. indexing php pages w/ indexing service (or index server) on W2K
    By Application Development in forum Inetserver
    Replies: 2
    Last Post: 07-19-2004, 10:45 AM
  5. Help with Indexing Service
    By Application Development in forum Inetserver
    Replies: 1
    Last Post: 04-14-2004, 10:00 AM