U.S. Department of Energy Office of Science Office of Scientific and Technical Information

Vantage Point: Government R&D Results Distributor to Accelerate Science Discovery

Slide1

Vantage Point: Government R&D Results Distributor to Accelerate Science Discovery

Sharon M. Jordan
Assistant Director for Program Integration
U.S. DOE Office of Scientific & Technical Information

Transparency
Colaboration
Participation

Global Data Access

Interagency Data Access
DOE Data Access

U.S. Department of Energy

Office of Science

osti.gov

Slide2

OSTI Mission


Volume of Content Made Searchable by OSTI

WorldWideScience.org:
400,000,000 pages of Global Scientific and Technical Information (STI) These web-available pages would fill 62,000 traditional 2-feet deep file drawers.

Science.gov:
200,000,000 pages of U.S. Government STI These web-available pages would fill 33,000 traditional 2-feet deep file drawers.

STIP Collection:
11,400,000 pages of U.S. Department of Energy STI These web-available pages would fill 1,900 traditional 2-feet deep file drawers.

Amount of Data Transferred in FY08: 9.95 terabytes

To advance science and sustain technological creativity by making R&D findings available and useful to Department of Energy (DOE) researchers and the public

  • Making government-sponsored R&D information available via single-query aggregated portals
  • Responding to practices of searchers.*

Librarians, researchers and the public can access a science page count comparable to, but not duplicative of, the entire science content of Google™

*In 2002 OCLC ?Perceptions of Library and Information Resources,? 84% of public began search using search engines; only 1% began with online databases.

Slide3

Free Aggregator of Multiple Government R&D-related Databases

Drills down to selected databases and websites in parallel, then presents ranked search results

Science Accelerator
Integrates 10 key DOE databases that cover a range of R&D results (reports, patents, citations, project summaries, eprints, etc.)

Science.gov
Integrates databases and websites of 14 U.S. science agencies

WorldWideScience.org
Integrates science information databases and portals from >60 nations

Slide4

Science.gov

Integrated Search Portal: Single Query of Science.gov Databases and Websites

Integrates simple search interface with database functionality & searches in real time:

  • over 40 databases of 14 U.S. science agencies
  • 200 million pages of science information
  • Thousands of authoritative scientific websites within .gov domain/sponsorship

Science.gov helps users identify databases for more advanced searching as needed

Science.gov results are uniquely different from Google and Google Scholar results 95.8% of the time

  • less than 1% overlap with Google
  • approximately 3.2% overlap with Google Scholar

Key Features

  • Clusters results
  • EurekAlert! Science News
  • Wikipedia links
  • Download capabilities
  • Relevancy ranked results
  • Alerts service
  • Search within a search
  • Email search results
  • Spell check
  • Results sorting
  • User-friendly navigation, preference options
Slide5

WorldWideScience.org

  • Searches 61 science databases and portals sponsored by governments and national institutions in 61 countries
  • Covers scientific literature from over three-fourths of the world?s population
  • Includes a vast quantity of science (over 400 million pages), providing significant journal coverage as well as a majority of grey literature.
  • Current research in multi-lingual translation technologies will enable non-English databases to be searched from the WWS interface
  • Proving WWS ?deep web? value, recent analysis shows only 3.5% overlap with Google and Google Scholar
Slide6

science.gov
Real Time Search
Relevancy Ranked
All Govt. Science
Known Sources
Scholarly Info

WorldWideScience.org
Real Time Search?
Relevancy Ranked
All Govt. Science
Known Sources
Scholarly Info

Google Scholar
Relevancy Ranked?
Scholarly Info
Ads?

Google
Relevancy Ranked?
Ads?

Slide7

Science.gov and WWS Discovery Tools vs. Discipline-Oriented Databases

Science.gov Govt Portal Search

  • One-stop access to multiple databases targeted for inclusion
  • Integrates and organizes huge amount of information
  • Provides well-rounded picture of topic to quickly scan environment
  • Offers user the ability to drill down to individual database(s)
  • Simple searching, yet powerful results: one search box or advanced search specifying fields or databases
  • Modules enhance user experience with quantity of information, e.g., clustering, filtering, sorting, menu selections

Discipline-Oriented Databases

  • Generally have more human input, preparation of data prior to posting
  • Have more administrative control (not reliant on underlying databases)
  • Presentation of information uses standard fields, standard look and feel, standard query search strings often makes for more comfortable user experience (especially at first)
  • Depth of information can be achieved due to discipline-specific nature

Conclusion: They are complementary and suitable for certain audiences/tasks/circumstances