| Website Policies and Important Links | E-print Web Log |
|
Technical Requirements From the E-print Network, users can Search e-prints on multiple Web sites and databases, browse e-prints by Disciplines, browse a list of Scientific Societies, and receive Alerts for new e-prints added to the E-print Network. E-prints on Web Sites search – The E-print Network E-prints on Web Sites searches indexed PDF documents residing on e-print Web sites. This search has been optimized to reduce the "noise" of irrelevant items and to minimize duplication. Database search – The deep Web search offers full-text searching within and across database collections. This provides users the capability to pulse the search engines of selected e-print databases with a single query. A compiled search results list is returned. If a description is needed for a database, please refer to the Collection Description page. The results of an E-print Network search may include full-text, bibliographic citations and other types of material. The searches are powered by Distributed Explorit from Deep Web Technologies. As a service to our user community, a list of Scientific Societies and associations which focus on topics related to the Department of Energy's research and development initiatives is provided. Many of these societies provide electronic publications. While access to some publications may be free of charge, others may require subscriptions, pay per view arrangements, or access through libraries or commercial providers. Disciplines contain links to e-print servers or e-prints provided by individual authors, which have been organized into subject-specific categories. The Alert Service will automatically notify you when new e-print information is indexed or added to databases in your specific areas of interest. Simply register for the service and then fill in your search criteria on the "Create an E-print Alert page". This site works best with Microsoft Internet Explorer 4.0 or higher. Viewers and an unzip utility are also recommended for displaying documents. Below is a list of URL's with links to some viewers and utilities which may be downloaded for use with this site.
Although resources may employ different options, the following are generally true for E-print Network searching:
Some collection search engines may not support field searching in some fields such as creator/author, title, and/or date. However, users may enter search terms in these fields and the E-print Network's federated search attempts the search by performing a search on the collection in the most applicable field available. For instance, if a collection does not support searching for an author/creator, the E-print Network's federated search may search that collection's full bibliographic record for the entered author's name. The "And" example: E-prints on Web Sites was selected, with the topic Biology and Medicine; in the Full Record field, the words "bioenergetics and proteomics" (without the quotation marks) were typed in. Date range and number of records to show were at default values. The "Or" example: E-prints on Web Sites DEselected; databases for Astrophysics (arXiv), General Relativity & Quantum Cosmology (arXiv), and Physics (arXiv) were all selected; title field contained the words "cosmological constant or supernova"; date range 2003-2003; default number of items to return per database (10). Output was (result files are deleted periodically): http://eprints.osti.gov/cgi-bin/genresults?qry1088689944 The wildcard example: NECI Research Index and Max-Planck Institute Preprints (MPI) both selected, "turbulen*" typed into Full Record field (without the quotation marks), date range 1990-2003, default number of records to display per database (10). It might be pointed out in any commentary that "Turbulen*" is chosen so that any documents with either of the words "turbulent" or "turbulence" will be found. Output was (result files are deleted periodically): http://eprints.osti.gov/cgi-bin/genresults?qry1699341634
Distributed Explorit and Explorit Focused Crawler Explorit is the core product from which all other current Deep Web Technologies products are derived. Basically, Explorit is a highly-configurable and easily customizable interface to commercially available, off-the-shelf (COTS) full-text search engines from Lextek International and Verity. Explorit provides the capability to deploy small to large-scale collections of information on the web - fully searchable and easily navigable - to a wide range of user communities. Large organizations or information purveyors with many collections of heterogeneous information benefit from the consistency and usability of the Explorit user interface: whether they deploy one collection or one hundred, users quickly learn that all Explorit applications operate essentially the same way, and variances are determined by content rather than inconsistent design. From the home page, click on "Disciplines" to browse e-print sites categorized by discipline or subject areas. Resources may appear under one or more discipline areas.
From the home page, click on "Find Scientific Societies" to browse a list of scientific societies and associations which focus on topics related to the Department of Energy's research and development initiatives. Many of these societies provide electronic publications. While access to some publications may be free of charge, others may require subscriptions, pay per view arrangements, or access through libraries or commercial providers.
Alerts can be set up for any combination of the collections represented on the "Search" page. These collections currently include reports on computer science, mathematics, and physics. To create a User Name and Password, follow these steps:
It is possible to edit and modify your alerts from your personal alert page. (If not already logged on, choose EXISTING PATRON, enter user name and password and click login.) Click on the name of the Alert you wish to edit. You may modify the search terms, select the number of records to retrieve, or add/delete e-print collections. When your changes have been made, click the Submit button, your changes will go into effect the next time the Alerts are processed. To delete an Alert, simply click on the Delete button. This sample alert, “Magnesium Diboride”, is designed to find new e-prints in the condensed-matter database of arXiv.org as well as e-prints on materials-science websites that contain “MgB” at the beginning of any term in the text or bibliographic data (abstract, title, &c.). The term “MgB*” is actually less precise than “MgB2”, but is intended to circumvent the problem of multiple subscript formats such as “MgB_2”, “MgB{sub 2}”, etc. The name given to the alert is of course arbitrary. Initial output for this alert can be seen here. Note that there are more items from some sources than the maximum setting. This is common for the first week's output, as the alert system catches up on items that were added before one's alert began running; in later weeks, the alerts will have no more than the maximum number of items from each source. This “Neutrino Mass” sample alert uses the search phrase “neutrino* AND mass” to find e-prints in the arXiv.org's experimental and phenomenological high-energy physics databases and in the KEK Information System database whose bibliographic records contain both the word “mass” and any word that begins with “neutrino”. The databases were chosen to turn up as many papers as possible on experimental evidence for neutrino mass while limiting the number of purely theoretical papers that don’t provide experimental evidence for or against massive neutrinos. The alert “Quantum Chaos” searches arXiv.org's math-preprint, computer-science, and quantum-physics databases, along with physics websites, for bibliographic records characterized by the character string “quantum chaos”.
|