To learn about crawlers and Content Crawler Web Services, click here.
To specify the destination folder and security for documents imported by this crawler:
Specify into which folders you want to import content. The crawler attempts to import a link to every document it finds into the most subordinate subfolder within the destination folder that allows the link to pass. Click here for a flow chart showing how the crawler determines into which folders it will import content.
To add destination folders, click Add Folder;
then, in the Choose Folders dialog box, select the folders you want to
add and click OK. To crawl documents
into a folder, you must have at least Edit access
to that folder.
To remove a folder, select the folder and
click .
To select or clear all of the folder check boxes, select or clear the box to the left of Folder Path.
To toggle the order in which the folders are
sorted, click Folder Path. The
icon to the right of Folder Path signifies
the current alphabetical sort order: ascending () or descending
(
).
If the Content Crawler Web Service used by this
crawler supports folder mirroring (specified on the Advanced
Settings page of the Content Crawler Web Service Editor), you can
have this crawler create Knowledge Directory folders that duplicate the
folder structure of the content repository being crawled by selecting
Mirror the source folder structure.
Notes:
If you mirror the folder structure and import security information with each document (described in Step 5), the folder security is imported for the mirrored folders.
If you mirror the folder structure, upon successive runs the crawler removes any portal folders that do not have corresponding source folders. For this reason, if you run this crawler periodically, neither you nor anyone else should modify the mirrored portal folders or documents in any way.
You cannot change the mirror setting after creation of this crawler. That is, if you set this crawler to mirror the folder structure, you cannot edit this setting later.
To require that documents pass the filters of destination folders before the documents are imported into those folders, select Apply Filter of Destination folder. By default, documents do not need to pass the filters of destination folders, so all documents will be imported into all destination folders.
To accept all imported documents into the portal
and make them immediately available to users, select Automatically
approve imported documents. By default, documents require approval.
This means that before the link to the imported document is available
to users, it must be approved by a portal administrator with at least
Edit access to the destination folder.
If you are mirroring the folder structure, you might want to set imported
documents to be approved automatically and restrict users to Read access
(users in the Administrators group always have Admin access). If you set
imported documents to require approval, be aware that any portal administrator
who has at least Edit access can also modify the folders and content,
and can therefore make your portal folders and content out of sync with
your source repository.
If the Content Crawler Web Service used by this
crawler supports security importation and the source repository users
and groups correspond to portal users and groups (specified in the Global ACL Sync
Map), you can have this crawler import security information with each
document by selecting Import security
with each document. This automatically makes documents that are
available to source repository users available to the mapped portal users.
Note: Because read access is equivalent in the source repository
and the portal, but write access is not, only read access is imported;
write access is ignored because write access to a document in an external
repository allows you to edit the document,
but write access (referred to as Edit access) in the portal allows you
to edit the properties
and security settings of that
document.
Under Document Access Privileges, you can perform the following actions to manually grant users and groups access to the content imported by this crawler:
To add users or groups, click Add Users/Groups;
then, in the Choose Groups and Users dialog box, select the users and
groups you want to add and click OK.
To add a user or group, you must have at least Select access
to that user or group.
For each user or group, in the associated Privilege drop-down list, choose the access privilege you want to grant for content imported by this crawler.
To remove a user or group, select the box
next to the user or group and click .
To select or clear all of the user and group check boxes, select or clear the box to the left of Users/Groups.
To toggle the order in which the users and
groups are sorted, click Users/Groups.
The icon to the right of Users/Groups
signifies the current alphabetical sort order: ascending () or descending (
).
To view the members of a group, click the group name.
To display the page associated with this help topic: