U.S. Department of Energy Office of Science Office of Scientific and Technical Information

WorldWideScience.org Multilingual Search of Chemistry and Other Sciences ACS National Meeting

Slide01

Slide01

WorldWideScience.org

Multilingual Search of Chemistry and Other Sciences

 

ACS National Meeting

Fall 2012

 

Brian A. Hitson, Associate Director

Office of Scientific & Technical Information

U.S. Department of Energy

WorldWideScience Alliance

Slide02

Slide02

What Is OSTI?

OSTI is a program within DOE’s Office of Science, with a corporate responsibility for ensuring access to DOE R&D results.

Since 1947!

  • Public access to unclassified, unlimited
  • Restricted access to classified and sensitive

 

Energy Policy Act of 2005

"The Secretary, through the Office of Scientific and Technical Information, shall maintain within the Department publicly available collections of scientific and technical information resulting from research, development, demonstration, and commercial applications activities supported by the Department."

 

"…the Department's role as a source of information…is unique and indispensable in the advancement of energy technologies."* -- from Quadrennial Technology Review press release

"Our success should be measured not when a project is completed or an experiment concluded, but when scientific and technical information is disseminated…" -- from 2011 Department of Energy Strategic Plan

Slide03

Slide03

DOE-Affiliated Articles by Publisher (2007-2012)

Elsevier 21%

American Chemical Society 19%

American Physical Society 18%

American Institute of Physics 8%

Institute of Physics 7%

Wiley 6%

Springer 4%

Source: Web of Science

Slide04

Slide04

OSTI Products

For specific document or media types

Information Bridge

DOE Data Explorer

ScienceCinema

E-Print Network

DOepatents

ESTSC

DOE Green Energy

DOE R&D Accomplishments

Science Conference Proceedings

Energy Citation Database

 

Aggregator Productsfederated search

Science Accelerator - Integrates key DOE databases Covers a range of R&D results (reports, patents, citations, e-prints, etc.)

Science.gov - Integrates 12 U.S. federal science agencies Databases and websites offer over 200 million pages of science information

WorldWideScience.org - Integrates >70 nations Provides over 400 million pages of science information from databases and portals worldwide; performs multilingual search across 10 languages; translation of English content for non-English speakers and non-English content for English speakers

Slide05

Slide05

WorldWideScience.org

The Global Science Gateway

Slide06

Slide06

History and Collaboration

  • WorldWideScience.org concept emanated from Science.gov model (2006)
  • Initial partnership between U.S. Department of Energy and the British Library (2007)
  • Transition to multilateral governance (WorldWideScience Alliance) and ICSTI* sponsorship (2008)

*International Council for Scientific and Technical Information

Slide07

Slide07

WorldWideScience.org searches the 'deep web'

  • Where science is hundreds of times larger than the “surface web”
  • Generally not searchable by major search engines

Deep Web

Slide08

Slide08

WorldWideScience.org

The Global Science Gateway

"Databases Participating in WorldWideScience.org"

Slide09

Slide09

SciELO Brazil

Slide10

Slide10

Cornell University Library

Slide11

Slide11

Directory of Open Access Journals

Slide12

Slide12

KoreaScience

Slide14

Slide14

Multilingual Translations

The world’s first "one to many" and "many to one" multilingual translations tool in science.

  • Most automatic translations are limited to translating from a single language into another single language.
  • WorldWideScience.org partnering with Microsoft® Translator enables true multilingual functionality.

Slide15

Slide15

Multilingual Translations

Translating ten languages, with potential for more

Arabic Arabic
Chinese Chinese
German Deutsch
English  
Spanish Español
French Français
Japanese Japanese
Korean Korean
Portuguese Português
Russian Russian

Slide17

Slide17

Access to Multimedia-based Science & Technology



A Case Study for Enhanced Multimedia Search & Retrieval

ScienceCinema

http://www.osti.gov/sciencecinema/

• Partnership between OSTI and Microsoft Research.
• Launched in February 2011; searches ~1,800 multimedia files.
• Utilizes Microsoft Research Audio Video Indexing System (MAVIS).
• Enables searching of digitized spoken content.
• Users can search for precise term within video and be directed to the exact point in the video where the term was spoken.

Slide19

Slide19

What's next for WorldWideScience.org?


• Federated search of "big data"
    • BioGRID
    • DataCite
    • DNA Data Bank of Japan
    • DRYAD
    • EMBL-EBI European Bioinformatics Institute
    • ICSU World Data System

Slide20

Slide20

WorldWideScience Alliance



osti.gov

Contact WWS.org Operating Agent:
Brian Hitson, hitsonb@osti.gov
Lorrie Johnson, johnsonl@osti.gov

Translations
powered by Microsoft®
Translator

Microsoft®
Research

ICSTI