Sharon Jordan
STI Meeting
March 20, 2002
-
Interagency project with 10 agencies
- Designated “FirstGov for Science” portal
- OSTI hosting domain
- Deep Web search function by OSTI, funded by FirstGov cross portal
- Currently available in test mode
- Launch being planned for May 2002
-
Web is the tool of choice
- Each science agency offers information to fulfill its mission
- Science is not bounded by organization or geography
- Interagency collaboration needed
- Audience: science-attentive citizen
- Gateway to agencies’ science information
- Multiple user paths via Web
- Voluntary collaboration - no significant burden
- Not a replacement for any agency’s activities
- Inclusive of all agencies’ science information
- Phased development
- Over 100 DOE URLs included
- S&T content based
- Information freely available
- Unrestricted (no passwords or log ins)
- Categorized by topic areas
- “Audience channels possible
- Historically STI efforts have concentrated on technical reports
- R&D results are also found in other types of literature
- Now and in future, focus is to improve comprehensiveness/awareness of all forms of R&D results
- Open source literature (articles and citations) issued by journal publishers that generally provide peer review of the articles prior to publication; includes publications issued by commercial publishers or professional societies.
- Issued by the author; typically any work posted on the Internet by the author at his/her home institution or at a preprint repository; includes papers considered in final form or those circulated for comment via the Internet; also referred to as "e-prints."
- Issued by noncommercial sources, e.g., government agencies, research labs/facilities, and educational institutions; typically unavailable through commercial publishers; includes technical reports, conference papers, dissertations, and other materials published by the author's sponsor or employer.
- OAI framework based on metadata harvesting.
- Two roles:
- Data providers (e.g., laboratories) support a simple harvesting protocol to provide extracts of local metadata in a common, minimal-level format in response to requests from service providers.
- Service providers (e.g., OSTI) use extracted metadata to provide user-oriented services, such as web portals, to materials that are distributed across multiple sources.
- Dublin Core-based fields and qualifiers adopted in OSTI processing & tools
- STI 241.1 form revision:
- To map fields more closely to DC
- To further streamline process
- To update terms and definitions
- First Dublin Core-based tool
- 1948 to present --> AEC, ERDA, DOE
- 2 million records (public); 4 million (DOE)
- Links to electronic full text, when available
- Includes citations to various types of STI
- Growth through regular updates
- Launched Jan. 2000, has grown to >8400 sites and ½ million
papers
- Covers range of eprints in disciplines of interest to DOE
- Cross-site search capability of researcher-posted papers
- Preprint Alerts
- Tool under development
- Web-accessible controlled access file
- To include various access limitations: ECI, OUO, CRADA, SBIR, etc.
- Addresses a current, growing need
- Based on guidance received to date:
- DOE memos (Blake)
- OpSec list of sensitive subjects
- Statutory protections
- Programmatic judgment
- Interim practices vs. mandated requirements
- Complete second review at OSTI
- Sites validate initial review
- Reconcile records
- Assess impact of possible new protection category
- Await further legal, programmatic and administrative guidance