Enormous STI Content Made Easily Searchable by OSTI

by Dr. Walt Warnick on Thu, Jul 1, 2010

We have integrated about ten OSTI products dealing with technical reports, e-prints, patents, conference proceedings, project summaries, etc., so that they are all searchable via s single query.  The integrated product allows users to search without first having to decide which OSTI product is likely to have the content he/she seeks.  This product is ScienceAccelerator.gov.

We have integrated comparable offerings from about 14 other agencies so that all the virtually combined offerings can be searched via a single query.  Science.gov allows users to search without first having to decide which agency offers which content.  The DOE contribution to Science.gov is ScienceAccelerator.gov .

We have integrated comparable offerings from about 70 other countries so that all the offerings can be searched via a single query.  The US contribution to WorldWideScience.org is Science.gov.  WorldWideScience.org allows users to search without first having to decide which country offers which content.  The virtual collection is enormous, being comparable in size to science made searchable via Google.  Our tests suggest, however, that well over 90% of the content of WorldWideScience is non-Googelable.

Until June 11, 2010, the content accessible via WorldWideScience had English titles and other bibliographic information.  On June 11, 2010 WorldWideScience became multilingual.  A beta application was launched which enables speakers of English to search databases posted on behalf of the Russian government for speakers of Russian.  Similarly, for Chinese and seven other languages.  And speakers of these other languages can search the English offerings of WorldWideScience.  The translation capabilities are provided by a collaboration with Microsoft.

Microsoft has posted a blog about Multilingual WWS by Tony Hey, their VP for External Research and a member of our ASCR advisory committee.  http://blogs.msdn.com/b/msr_er/archive/2010/06/16/microsoft-research-and-worldwidescience-org-collaborate-to-break-down-language-barriers.aspx

An indication of the impact of WorldWideScience is that Microsoft has stepped forward to host the 2011 winter meeting of the International Council on Scientific and Technical Information at Microsoft headquarters in Washington state.  This is a major proactive gesture by Microsoft, as hosting one of these meetings is a significant undertaking.  Among the functions at the meeting will be a public session of presentations by invited speakers.  It is safe to predict that this meeting will be well attended.

At this meeting, we plan to launch in beta version the OSTI product that makes the audio portion of DOE STI videos full text searchable.  This product also involves a close collaboration between OSTI and Microsoft.

Then, at the 2011 ICSTI summer meeting in Beijing, we plan to incorporate this new product into WorldWideScience so that it is searched by every WWS query.

Walt Warnick

Other Related Topics: federated search


About the Author

Dr. Walt Warnick's picture
Dr. Walt Warnick
Former Director, U.S. DOE Office of Scientific and Technical Information