MOSS 2007 Search - Excluding Complex URL's : Sharepoint
This is a discussion on MOSS 2007 Search - Excluding Complex URL's within the Sharepoint forums in Microsoft Tools category; Hello, I am attempting to exclude a portion of an external php website that is a content source in a MOSS 2007 portal project I am working. However, I am not having any luck in getting the crawl rules to work. I have tried to add a wildcard (i.e. asterisk) character, but this does not work unless I add it right after the "?", e.g. .../index.php?*, however this excludes too much of the site. Ideally I would like to exclude based on further parameters, e.g. / index.php?=calendar&id=80* Has anyone been able to do this or have any suggestions? I've searched ...
| Sharepoint Microsoft sharepoint portal server development, administration and related discussions |
![]() |
| | LinkBack | Thread Tools |
|
#1
| |||
| |||
| I am attempting to exclude a portion of an external php website that is a content source in a MOSS 2007 portal project I am working. However, I am not having any luck in getting the crawl rules to work. I have tried to add a wildcard (i.e. asterisk) character, but this does not work unless I add it right after the "?", e.g. .../index.php?*, however this excludes too much of the site. Ideally I would like to exclude based on further parameters, e.g. / index.php?=calendar&id=80* Has anyone been able to do this or have any suggestions? I've searched everywhere and have not found a single article or post addressing this scenario (with an actual answer/response). Thanks, Ian |
|
#2
| |||
| |||
| I'm having this exact same problem... trying to exclude a URL like this: http://server.here.com/phpapps/ssg_k....php?print=519 With any of these rules doesn't work: *://*/*print=* *://*/index.php?print=* *://*/index.php*print=* *://*/*index.php?print=* *://*/*index.php*print=* On Nov 26 2007, 2:05 pm, iclark.consult...@gmail.com wrote: > Hello, > > I am attempting to exclude a portion of an external php website that > is a content source in a MOSS 2007 portal project I am working. > However, I am not having any luck in getting the crawl rules to > work. > > I have tried to add a wildcard (i.e. asterisk) character, but this > does not work unless I add it right after the "?", > e.g. .../index.php?*, however this excludes too much of the site. > Ideally I would like to exclude based on further parameters, e.g. / > index.php?=calendar&id=80* > > Has anyone been able to do this or have any suggestions? > > I've searched everywhere and have not found a single article or post > addressing this scenario (with an actual answer/response). > > Thanks, > > Ian |
![]() |
« Previous Thread
|
Next Thread »
| Thread Tools | |
| |
| ||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| MOSS Search: Crawl Rules: Crawler not crawling complex Urls | Application Development | Sharepoint | 5 | 02-01-2010 03:17 AM |
| MOSS 2007 search service hangs sometimes when the search is comple | usenet | Sharepoint | 0 | 11-27-2007 09:48 AM |
| MOSS 2007, Search Box and Search Core Results | usenet | Sharepoint | 3 | 11-13-2007 11:37 PM |
| MOSS 2007 - Search | usenet | Sharepoint | 1 | 06-27-2007 09:59 AM |
| Re: Excluding a path from MOSS 2007 | usenet | Sharepoint | 0 | 06-01-2007 05:07 PM |
All times are GMT -5. The time now is 08:55 AM.


