Enterprise Web Searching Enterprise Services maintains an web search engine which is available to all members of the Johns Hopkins Institutions. The root page for the search engine is: http://search.jhmi.edu/ -or- http://search.jhu.edu/ Both addresses are equally valid, but for simplicity's sake, the jhmi.edu address will be used for the remainder of this document. Members of the Johns Hopkins institutions can use this centralized search service to index their own web sites. Search boxes can be placed on individual web sites which offer the user the opportunity to search the entire index or just the portions of the index pertaining to a particular site. Results pages can also be customized to match the look and feel of an existing web site. Adding a search box to your page is very simple and requires only a small amount of HTML coding. Several examples are included at the bottom of this document. Is My Site Indexed? To see if the search index currently contains entries for your site, go to http://search.jhmi.edu/ and enter the following in the search box: url:www.mysite.org Replace www.mysite.org with the URL for your site. The results will reveal how many unique entries the search index contains that match the given URL. For example, if you typed url:www.jhu.edu in the search box, the following results page would indicate there are over 30,000 entries that contain www.jhu.edu in the URL. Adding Your Site There are several on-line utilities which allow users to directly interact with the search engine and its indexes. These utilities are located on the search engine help page and are only available from computers physically located on the Johns Hopkins campuses. Here are the direct links to the utilities currently available: - Add URL - Use this to add your site to the index. It will only be added if it is a valid Hopkins URL. If the site already exists in the index, the URL status will be displayed. Use a full URL for this utility.
URL Status - Another method of finding out whether a site currently exists in the index. If the site is in the index, then lots of useful information from the index is available. Use a full URL for this utility. - View Sites - lists the sites currently in the index.
- Revisit Site - directs the search engine to reindex the site. Use this option if you've made major changes to your site on the file/directory level and they haven't been reflected in the search index.
The URL for your site must be "allowed" by the search engine or it cannot be added. To check the list of allowed URLs, click here. If you have any questions about indexing, search boxes, or other searching issues, please reference the built-in searching help or email searchadmin@jhmi.edu. Increasing Relevance Ratings Using the <title> tag within the <head> section of each document will help the search engine more accurately index your content. Details on using <meta> tags to improve the indexing of your site can be found here: http://search.jhmi.edu/help/meta.html?la=en< /A> Excluding Your Site There are two main ways to prevent your site from being indexed: robots.txt files and robots <meta> tags. More information about using robots.txt files and robots <meta> tags to prevent indexing is available here: http://www.robotstxt.org/wc/exclusion.html |