Data repositories

From OAD

Jump to: navigation, search

Oad2.jpeg This list is part of the Open Access Directory.

  • This is a list of repositories and databases for open data.
  • Please annotate the entries to indicate the hosting organization, scope, licensing, and usage restrictions (if any). If a repository is open in some respects but not others, please include it with an annotation rather than exclude it.
  • If you're not sure whether a given dataset or data collection is open, post your query to Is It Open Data?
  • Related lists in OAD: Disciplinary repositories (primarily for texts, not data).

Contents

[edit] Archaeology

  • Also see Social sciences.

[edit] Astronomy

  • Also see Physics.

[edit] Biology

  • Also see BCO-DMO, Marine Biology data, listed with Marine Sciences repositories.
  • Also see DataONE, Entrez databases, KNB, and PANGAEA, listed under Multidisciplinary repositories.
  • The Cell: An Image Library Images of all cell types from all organisms, including intracellular structures and movies or animations demonstrating functions. This project relies upon the cell biology community to populate the library. The Cell: An Image Library™ is a freely accessible, easy-to-search, public repository of reviewed and annotated images, videos, and animations of cells from a variety of organisms, showcasing cell architecture, intracellular functionalities, and both normal and abnormal processes. The purpose of this database is to advance research, education, and training, with the ultimate goal of improving human health.
  • Gene Expression Omnibus High-throughput functional genomic data, including all array-based applications and some high-throughput sequencing data.
  • National Biological Information Infrastructure A broad, collaborative program to provide increased access to data and information on the nation's biological resources. The NBII links diverse, high-quality biological databases, information products, and analytical tools maintained by NBII partners and other contributors in government agencies, academic institutions, non-government organizations, and private industry.
  • Biological Information Infrastructure A broad, collaborative program to provide increased access to data and information on the nation's biological resources. The NBII links diverse, high-quality biological databases, information products, and analytical tools maintained by NBII partners and other contributors in government agencies, academic institutions, non-government organizations, and private industry.
  • Planet A network of European Plant Databases
  • The Universal Protein Resource (UniProt) is a comprehensive resource for protein sequence and annotation data. The UniProt databases are the UniProt Knowledgebase (UniProtKB), the UniProt Reference Clusters (UniRef), and the UniProt Archive (UniParc). The UniProt Metagenomic and Environmental Sequences (UniMES) database is a repository specifically developed for metagenomic and environmental data..

[edit] Chemistry

  • Also see BCO-DMO, Marine Biology data, listed with Marine Sciences repositories.
  • Also see Entrez databases, listed under Multidisciplinary repositories.
  • Cambridge Structural Database The CCDC is a non-profit, charitable Institution whose objectives are the general advancement and promotion of the science of chemistry and crystallography for the public benefit.
  • ChemSpider. Hosted by the Royal Society of Chemistry.
  • ChemSynthesis. A database of chemicals and their physical properties.
  • eCrystals. From the Southampton Chemical Crystallography Group and the EPSRC UK National Crystallography Service.

[edit] Computer Science

  • CiteSeerX provides its databases of nearly 2 million documents and the associated texts and pdfs for research.
  • GitHub keeps your public and private code available, secure, and backed up.
  • Google Code Project Hosting Project Hosting on Google Code provides a free collaborative development environment for open source projects. Each project comes with its own member controls, Subversion/Mercurial repository, issue tracker, wiki pages, and downloads section. Our project hosting service is simple, fast, reliable, and scalable, so that you can focus on your own open source development.
  • Launchpad can host your project’s source code using the Bazaar version control system. We also import over 2000 CVS, SVN, Git and Mercurial projects, so you can use Bazaar with those too.
  • SourceForge 2.7 million developers create powerful software in over 260,000 projects. Our popular directory connects more than 46 million consumers with these open source projects and serves more than 2,000,000 downloads a day. SourceForge is where open source happens.

[edit] Energy

[edit] Environmental sciences

  • Also see BCO-DMO, Marine Biology data, listed with Marine Sciences repositories.
  • Also see DataONE, KNB, and PANGAEA, listed under Multidisciplinary repositories.
  • The Marine Geoscience Data System (MGDS) The Marine Geoscience Data System (MGDS) provides access to data portals for the NSF-supported Ridge 2000 and MARGINS programs, the Antarctic and Southern Ocean Data Synthesis, the Global Multi-Resolution Topography Synthesis, and Seismic Reflection Field Data Portal.

[edit] Geology

  • Also see PANGAEA, listed under Multidisciplinary repositories.
  • IRIS (Incorporated Research Institutions for Seismology). From 100+ US universities and the National Science Foundation.

[edit] Geosciences and geospatial data

  • Also see DataONE and PANGAEA, listed under Multidisciplinary repositories.
  • GeoNames. A database of placenames, under a CC-BY license. Founded by Marc Wick.
  • The Geosciences Network (GEON) project is a collaboration among a dozen PI institutions and a number of other partner projects, institutions, and agencies to develop cyberinfrastructure in support of an environment for integrative geoscience research. GEON is funded by the NSF Information Technology Research (ITR) program.
  • The National Space Science Data Center serves as the permanent archive for NASA space science mission data. "Space science" means astronomy and astrophysics, solar and space plasma physics, and planetary and lunar science. As permanent archive, NSSDC teams with NASA's discipline-specific space science "active archives" which provide access to data to researchers and, in some cases, to the general public.
  • ShareGeo. Integrating the older GRADE (Geospatial Repository for Academic Deposit and Extraction) repository. From EDINA.

[edit] Linguistics

[edit] Marine sciences

  • Also see DataONE and PANGAEA, listed under Multidisciplinary repositories.
  • BCO-DMO. The Biological and Chemical Oceanography Data Management Office, provides access to data sets contributed by investigators funded by the Biological and Chemical Oceanography sections of the US National Science Foundation (NSF).

[edit] Medicine

  • Also see Entrez databases, listed under Multidisciplinary repositories.
  • The Health and Medical Care Archive (HMCA) is the data archive of the Robert Wood Johnson Foundation (RWJF), the largest philanthropy devoted exclusively to health and health care in the United States. Operated by the Inter-university Consortium for Political and Social Research (ICPSR) at the University of Michigan, HMCA preserves and disseminates data collected by selected research projects funded by the Foundation and facilitates secondary analyses of the data. The data collections in HMCA include surveys of health care professionals and organizations, investigations of access to medical care, surveys on substance abuse, and evaluations of innovative programs for the delivery of health care. Our goal is to increase understanding of health and health care in the United States through secondary analysis of RWJF-supported data collections.
  • MIRAGE (Middlesex medical Image Repository with a CBIR ArchivinG Environment). From JISC and Middlesex University.

[edit] Multidisciplinary repositories

  • Also see Social Sciences.
  • Also see BCO-DMO, Marine Biology data, listed with Marine Sciences repositories.
  • DataONE DataONE is an international federation of data repositories containing earth observations data, including data from fields such as ecology, biology, evolution, and environmental sciences such as hydrology, oceanography, and atmospheric science. DataONE is a federation with participation from hundreds of field stations, universities, and government agencies through the DataONE Member Nodes.
  • FigShare. Scientific publishing as it stands is an inefficient way to do science on a global scale. A lot of time and money is being wasted by groups around the world duplicating research that has already been carried out. FigShare allows you to share all of your data, negative results and unpublished figures. In doing this, other researchers will not duplicate the work, but instead may publish with your previously wasted figures, or offer collaboration opportunities and feedback on preprint figures.
  • KPBC. Regional academic repository for data in all fields. Poland
  • UPSpace University of Pretoria Research Repository, South Africa.

[edit] Physics

  • Also see Astronomy.
  • HEP Data The data comprise total and differential cross sections, structure functions, fragmentation functions, distribuitions of jet measures, polarisations, etc... from a wide range of interactions.

[edit] Social sciences

  • Also see Multidisciplinary repositories.
  • ICPSR (Inter-University Consortium for Political and Social Research). At the University of Michigan.
Personal tools
Open Access Directory