News & Alerts

  

     IT@JH Home

  

Technical Professionals

  Contact IT@JH

Enterprise Web Searching

Enterprise Services maintains an web search engine which is available to all members of the Johns Hopkins Institutions.

The root page for the search engine is:
http://search.jhmi.edu/ -or- http://search.jhu.edu/

Both addresses are equally valid, but for simplicity's sake, the jhmi.edu address will be used for the remainder of this document.

Members of the Johns Hopkins institutions can use this centralized search service to index their own web sites. Search boxes can be placed on individual web sites which offer the user the opportunity to search the entire index or just the portions of the index pertaining to a particular site. Results pages can also be customized to match the look and feel of an existing web site.

Adding a search box to your page is very simple and requires only a small amount of HTML coding. Several examples are included at the bottom of this document.

Is My Site Indexed?
To see if the search index currently contains entries for your site, go to http://search.jhmi.edu/ and enter the following in the search box:

url:www.mysite.org

Replace www.mysite.org with the URL for your site. The results will reveal how many unique entries the search index contains that match the given URL. For example, if you typed url:www.jhu.edu in the search box, the following results page would indicate there are over 30,000 entries that contain www.jhu.edu in the URL.

Adding Your Site
There are several on-line utilities which allow users to directly interact with the search engine and its indexes. These utilities are located on the search engine help page and are only available from computers physically located on the Johns Hopkins campuses. Here are the direct links to the utilities currently available:

  • Add URL - Use this to add your site to the index. It will only be added if it is a valid Hopkins URL. If the site already exists in the index, the URL status will be displayed. Use a full URL for this utility.
URL Status - Another method of finding out whether a site currently exists in the index. If the site is in the index, then lots of useful information from the index is available. Use a full URL for this utility.
  • View Sites - lists the sites currently in the index.
  • Revisit Site - directs the search engine to reindex the site. Use this option if you've made major changes to your site on the file/directory level and they haven't been reflected in the search index.

The URL for your site must be "allowed" by the search engine or it cannot be added. To check the list of allowed URLs, click here. If you have any questions about indexing, search boxes, or other searching issues, please reference the built-in searching help or email searchadmin@jhmi.edu.

Increasing Relevance Ratings
Using the <title> tag within the <head> section of each document will help the search engine more accurately index your content. Details on using <meta> tags to improve the indexing of your site can be found here: http://search.jhmi.edu/help/meta.html?la=en< /A>

Excluding Your Site
There are two main ways to prevent your site from being indexed: robots.txt files and robots <meta> tags. More information about using robots.txt files and robots <meta> tags to prevent indexing is available here:  http://www.robotstxt.org/wc/exclusion.html

Search Box HTML Examples

This piece of code will create a form which returns results from the entire collection on the new search server. This would usually be labeled as "Search All of Hopkins" or something like that:

<form action="http://search.jhmi.edu/query.html">
<input type="text" size="15" maxlength="35" name="qt">
<input type="submit" value=" Search ">
</form>

This piece of code will create a form which returns results from the entire collection on the new search server, but whose results would be limited to pages which contain "www.myurl.edu" in the URL. "www.myurl.edu" would be altered to represent the URL used by your site. Multiple "qp" input statements can be included to accommodate sites with multiple URLs. In this case, the statements are treated like a boolean "OR". This type of form would usually be labeled as "Search Our Site" or something like that:

Single Site:
<form action="http://search.jhmi.edu/query.html">
<input type="hidden" name="qp" value="url:www.myurl.edu">
<input type="text" size="15" maxlength="35" name="qt">
<input type="submit" value=" Search ">
</form>

Multiple Sites (Method 1):
<form action="http://search.jhmi.edu/query.html">
<input type="hidden" name="qp" value="url:www.myurl.edu">
<input type="hidden" name="qp" value="url:www.myurl2.edu">
<input type="text" size="15" maxlength="35" name="qt">
<input type="submit" value=" Search ">
</form>

Multiple Sites (Method 2):
<form action="http://search.jhmi.edu/query.html">
<input type="hidden" name="qp" value="url:www.myurl.edu, url:www.myurl2.edu">
<input type="text" size="15" maxlength="35" name="qt">
<input type="submit" value=" Search ">
</form>

An example which offers a choice between all of Hopkins and a specific site:

<form action="http://search.jhmi.edu/query.html">
<input type="text" size="20" maxlength="35" name="qt">
<input type="submit" value=" Search ">
<input type="radio" name="qp" value="url:www.myurl.org" checked>My Site
<input type="Radio" name="qp" value="">All of Hopkins
</form>

Web Hosting Home

Akamai Video Streaming

DNS Requests

Useful DNS Facts

Network Time Protocol

JHARS Step by Step Instructions

Content Management System

Hosting Policies

Contract Systems Administration

Up-to-Date Server

Personal Webspace

Web Searching

Untitled Document