Find DOE Collections
International Collaboration Speeds
Advances in E-Science
Multilingual WorldWideScience.org: International Collaboration Speeds Advances in E-Science
United Nations Commission
on Science and Technology for Development
25 May 2011
Walter L. Warnick, Ph.D.
Director, Office of Scientific and Technical Information
U.S. Department of Energy
What Is WorldWideScience.org?
• A search engine covering national scientific databases and R&D results from the governments of 71 countries
• A tool that integrates these results and makes them searchable by a single query, then returns them in relevance order
• A virtual collection that is enormous and mostly non-Googleable
• A portal that offers translations between a number of languages
It is up and working now, freely available without registration to anyone with internet access
What Is DOE?
U.S. Department of Energy (DOE) Mission: to ensure America's security and prosperity by addressing its energy, environmental, and nuclear challenges through transformative science and technology solutions.
DOE is the single largest government supporter of basic research in the physical sciences in the United States, providing more than 40 percent of total Federal funding. It oversees, and is the principal Federal funding agency of, the Nation's research programs in high-energy physics, nuclear physics, and fusion energy sciences.
What Is OSTI?
The Office of Scientific and Technical Information (OSTI) is a program within DOE's Office of Science with a corporate responsibility for ensuring access to DOE R&D results.
• We make DOE R&D results findable and accessible, not just within DOE, but globally
• We make other people's R&D results findable, bringing worldwide R&D to DOE and beyond
MISSION: To advance science and sustain technological creativity by making R&D findings available and useful to DOE researchers and the public.
OSTI's Responsibility: DOE Scientific and Technical Information Program
• OSTI coordinates with points of contact across the DOE complex
• DOE R&D results are:
• Collected from DOE offices, nat'l labs, & facilities, as well as university grantees;
• Preserved for re-use; and
• Made accessible via multiple web outlets.
• Interagency and international exchanges/partnerships leverage access and use of DOE R&D results
science.gov 5.0 USA.gov for SCIENCE
WorldWideScience.org - The Global Science Gateway
Department of Energy National Laboratories
• 30,000 scientists and engineers within the DOE enterprise
• Over 25,000 facility users and over 11,000 visiting scientists per year
Our Common Interests
UNCTAD United National Conference on Trade and Development
The Commission on Science and Technology for Development (CSTD)
"the United Nations torch-bearer for science, technology and innovation."
Founded on the principle that "advances in science can be accelerated when the diffusion of science knowledge is accelerated."
Both organizations believe that scientific progress requires global collaboration.
Both organizations are dedicated to:
• Sharing and strengthening global knowledge;
• Bridging the digital divide through information and communications technology; and
• Accelerating progress and development worldwide.
• Governed by international WorldWideScience Alliance
• Chaired by UK
- Richard Boulderstone, British Library
• Under the umbrella of the International Council for Scientific and Technical Information
• Diverse Executive Board Leadership from Canada, South Africa, South Korea, and the U.S.
• Supported financially by 49 countries
• Operated by U.S. Department of Energy/OSTI
• R&D results in sustainable energy, medicine, agriculture, environment and physical sciences
• Provides free searching of open-source collections and portals
• Searches multiple data sources with a single query; ranks results in relevance order
Launched in 2008
Integrates 71 nations
Provides over 400 million pages of science information from databases and portals worldwide
Launched in 2010
To further accelerate access to science, multilingual translations are needed in both directions
• Translation of English content for non-English speakers … and …
• Translation of non-English content for English speakers
• Advances science across language barriers
• Offers real-time translations of globally dispersed scientific literature in 10 languages (Arabic to be added June 2011)
• Benefits the English-speaking science community by providing searching and translation of non-English sources
• Benefits native speakers of other major languages by translating search results in the user's language of choice
Our Shared Vision
world summit on the information society
Geneva 2003 - Tunis 2005
12 December 2003
In keeping with the vision of the World Summit on the Information Society (WSIS), WorldWideScience:
• "strive [s] to promote universal access with equal opportunities for all to scientific knowledge and the creation and dissemination of scientific and technical information."
• is the product of "new forms of solidarity, partnership and cooperation among governments and other stakeholders."
• provides "equitable access to information for economic, social, political, health, cultural, educational and scientific activities."
• "facilitates access to public domain information …"
• "promote [s] the production and accessibility to all content educational, scientific, cultural or recreational in diverse languages and formats."
WorldWideScience.org could be called the quintessential World Summit on the Information Society information and communication technology for E-science.
The "Deep Web"
We Integrate or Aggregate Multiple Government R&D-related Databases into Single-Search Portals
Innovative technology, federated search, drills down to selected databases and websites in parallel, then presents relevance ranked search results
• The "surface web" includes the billions of pages searched by using conventional search engines (e.g. Google, Yahoo! Bing).
• The "deep web" contains huge document repositories not reached by traditional search engines
- more than 500 times the size of the surface web
- contains perhaps 99 percent of web-accessible scientific documents
• allows users to search multiple data sources simultaneously via a single query;
• presents results in ranked order relevant to the search query; and
• places no requirements or burdens on database owners
From U.S. Science Portal
Science.gov, launched in December 2002, pioneered the use of federated search across the U.S. government:
• Includes scientific and technical information from 14 U.S. science agencies representing 97% of the federal R&D budget
• Now offers access to more than 45 databases, 2,000 websites and 200 million pages of science information via a single query
• Served as the model for WorldWideScience.org
• Is the U.S. contribution to WorldWideScience.org
… to Global Science Gateway
June 2006, at the annual conference of the International Council for Scientific and Technical Information (ICSTI), OSTI proposed taking the national model of Science.gov global.
Our concept: "Science.world" would employ federated search and relevancy ranking technologies to provide access to scientific databases and portals around the world
January 2007, the British Library and U.S. Department of Energy signed a Statement of Intent to collaborate on a global science gateway and invited other nations to join the effort.
Statement of Intent for A Global Science Gateway
Closing the digital divide
"The technology exists for federated searching across vast, dispersed science information system."
"…nations have recognized the importance of providing their citizens with one-stop electronic access to increasing volumes of scientific information"
"…growing sense of the need for reciprocity and sharing of science knowledge across national boundaries"
"There are existing science information systems and collections which provide the critical foundation and content for a global, decentralized body of science knowledge
Debut of WorldWideScience.org
June 2007, at the ICSTI conference in Nancy, France, a prototype global science gateway was debuted
• Now called WorldWideScience.org
• Performed federated searching of 12 portals and databases across 10 countries
February 2008, at the ICSTI meeting in Paris, Terms of Reference were adopted for the global science gateway
• ICSTI would be a primary sponsor of the Alliance
• OSTI would serve as the Operating Agent for WorldWideScience.org and secretariat to the Alliance
Launch of the WorldWideScience Alliance
June 12, 2008, The WorldWideScience Alliance was officially launched at the annual ICSTI conference in Seoul
• Provided free real-time search of 32 NATIONAL scientific databases and portals in 44 countries
• Covered 6 continents and nearly half of the world's population
• Offered searchable access to 200 million pages of science content
Organizations representing 38 of the 44 countries agreed to take part in governance and funding
Growth of WorldWideScience.org
WorldWideScience.org has grown at a powerful rate. Today WorldWideScience.org:
• Provides access to content from 71 countries and over 80 national databases and portals
• Covers nearly 80% of the world's population
• Searches an estimated 400 million pages of important scientific portals worldwide
- Well-known sources: the U.S. NIH's PubMed, CERN, KoreaScience
- And more obscure sources: Bangladesh Journals Online
KOREA Science The Gateway to S&T Information in Korea
Bangladesh Journals Online
Uniqueness of WorldWideScience.org
• 33 sample queries launched in Google, Google Scholar, and WorldWideScience.org
• Similar quantities in the numbers of results, but very little overlap.
• Among the "top 50" results from each search engine, only 2.4% overlap or 97.6% uniqueness in WorldWideScience.org results.
WWS - 97.6% "Unique"
Google scholar beta
International public-private Collaboration
• WorldWideScience Alliance
• U.S. DOE/OSTI
• Deep Web Technologies
• Microsoft Research
State of the Search
Coming June 2011
Expanding Multilingual Translations
• Multilingual WorldWideScience.orgBETA integrated into the main WorldWideScience.org site
• Addition of Arabic to translations
• The world's 5th most commonly-spoken language
• One of 6 official UN languages
Coming June 2011
Integrates Multimedia-based Science & Technology
• Multimedia (e.g. video, audio, images) represents a major emerging form of scientific information
• Multimedia presents special opportunitiesand challenges lack of written transcripts, minimal metadata, scientific/technical/medical terminology, lengthy videos (>1 hour)
Coming June 2011
WorldWideScience.org Goes Mobile
Growth in smart phone capabilities, speed, and usage is phenomenal.
• Majority of usage growth emanating from developing countries.
• Mobile phones allow developing countries to "leapfrog" old technologies serving to close the "digital divide."
Compatible with major brands of "smart phones" iPhone, Android, Blackberry.
Provides access to over 80 scientific databases, many of which are not individually optimized for mobile web searching.
Extending Our Partnership
WorldWideScience.org includes R&D findings published by or on behalf of 21 of the current member states of the UN Commission on Science and Technology for development:
Republic of Congo
Extending Our Partnership
Going forward, the WorldWideScience Alliance will welcome opportunities to collaborate further with the UN Commission on Science and Technology for Development
Please help by:
• Bringing WorldWideScience.org to the attention of science-attentive citizens around the world
• Encouraging science-producing nations that are not already participating in WorldWideScience.org to offer their own scientific databases
The Promise of WorldWideScience.org
With each new database that is added to WorldWideScience.org, a new door of opportunity opens
It can be the science component of digital libraries throughout the world
WorldWideScience.org encompasses the vision and principles of the World Summit on the Information Society for science, technology and innovation and our global science gateway has the power to improve lives around the world