{"id":3,"date":"2024-04-04T04:36:11","date_gmt":"2024-04-04T01:36:11","guid":{"rendered":"https:\/\/sisu.ut.ee\/andmehaldus\/data-repositories\/"},"modified":"2024-04-16T16:58:36","modified_gmt":"2024-04-16T13:58:36","slug":"data-repositories","status":"publish","type":"page","link":"https:\/\/sisu.ut.ee\/andmehaldus\/data-repositories\/?lang=en","title":{"rendered":"Data repositories"},"content":{"rendered":"<p style=\"text-align: justify\">\n\tResearch data repositories are data storage facilities which store and curate data sets, ensuring their long-time preservation and access to the data.\n<\/p>\n<blockquote>\n<p style=\"text-align: justify\">\n\t\tWhen planning their data management, it is important for researchers to decide upon the repository where they will store their data.\n\t<\/p>\n<\/blockquote>\n<p style=\"text-align: justify\">\n\t<strong>Repositories are registred in\u00a0<a href=\"https:\/\/www.re3data.org\/\" target=\"_blank\" rel=\"noopener\">re3data<\/a><\/strong>.\u00a0\n<\/p>\n<p style=\"text-align: center\">\n\t<img decoding=\"async\" alt=\"re3data\" src=\"https:\/\/sisu.ut.ee\/wp-content\/uploads\/sites\/301\/re3data.png\" style=\"width: 243px;height: 119px\">\n<\/p>\n<p style=\"text-align: justify\">\n\tThere are four main types of repositories:\n<\/p>\n<p style=\"text-align: justify\">\n\t<strong>Universal repositories <\/strong>\n<\/p>\n<p style=\"text-align: justify\">\n\tRepositories of this type accept data in all formats. However, in case of very specific formats it is more advisable to use subject repositories.\n<\/p>\n<ul>\n<li style=\"text-align: justify\">\n\t\t<a href=\"https:\/\/zenodo.org\/\">Zenodo<\/a>\u00a0is a research data repository\u00a0 that is financed by the EU foundations and all researchers may upload their data (up to 50GB).\n\t<\/li>\n<li style=\"text-align: justify\">\n\t\t<a href=\"https:\/\/figshare.com\/\">Figshare<\/a>\u00a0is also a universal repository, allowing to upload data sets, articles, videos, posters, code, etc.\n\t<\/li>\n<li style=\"text-align: justify\">\n\t\t<a href=\"https:\/\/osf.io\/\">Open Science Framework<\/a>\u00a0\u2013\u00a0is a repository and data management platform able to interoperate with Dropbox, GitHub and Zotero\n\t<\/li>\n<\/ul>\n<p style=\"text-align: justify\">\n\t<strong><span style=\"background-color:#ffa07a\">Additional reading:\u00a0<\/span><\/strong><span style=\"background-color:#ffa07a\">\u00a0<\/span><a data-url=\"https:\/\/fairsharing.org\/collection\/GeneralRepositoryComparison\" href=\"https:\/\/fairsharing.org\/collection\/GeneralRepositoryComparison\" target=\"_blank\" title=\"\" rel=\"noopener\">General Repository Comparison<\/a>\u00a0is a tool for researchers to help select the best general repository for their data or other digital objects.\n<\/p>\n<p style=\"text-align: justify\">\n\t<strong>Disciplinary repositories\u00a0<\/strong>\n<\/p>\n<p style=\"text-align: justify\">\n\tRepositories of this type are essential to researchers because they support more specific formats and subject metadata standards. The number of such repositories is relatively large, here are a few examples of them:\n<\/p>\n<ul>\n<li style=\"text-align: justify\">\n\t\t<a data-url=\"https:\/\/www.dataone.org\/ \" href=\"https:\/\/www.dataone.org\/%20\" target=\"_blank\" title=\"\" rel=\"noopener\">DataOne<\/a>, <a href=\"https:\/\/www.pangaea.de\/\">PANGAEA<\/a> \u2013 environmental data and ecology\n\t<\/li>\n<li style=\"text-align: justify\">\n\t\t<a href=\"http:\/\/opentrials.net\/\">OpenTrials<\/a> \u2013 medicine\n\t<\/li>\n<li style=\"text-align: justify\">\n\t\t<a data-url=\"hthttps:\/\/www.gesis.org\/en\/home\" href=\"\/\/www.gesis.org\/en\/home\" target=\"_blank\" title=\"\" rel=\"noopener\">GESIS<\/a>, <a href=\"https:\/\/www.icpsr.umich.edu\/icpsrweb\/ICPSR\/\">ICPSR<\/a> \u2013 data archive of social sciences\n\t<\/li>\n<li style=\"text-align: justify\">\n\t\t<a href=\"https:\/\/www.tdar.org\/\">tDAR<\/a> \u2013 archaeology\n\t<\/li>\n<li style=\"text-align: justify\">\n\t\t<a href=\"http:\/\/www.crystallography.net\/cod\/index.php\">COD<\/a>, <a href=\"https:\/\/qsardb.org\/\">QsarDB<\/a> \u2013 chemistry\n\t<\/li>\n<\/ul>\n<p style=\"text-align: justify\">\n\tSometimes research funders publish a list of accepted repositories, let us take a look at <a data-url=\"https:\/\/www.nlm.nih.gov\/NIHbmic\/domain_specific_repositories.html\" href=\"https:\/\/www.nlm.nih.gov\/NIHbmic\/domain_specific_repositories.html\" target=\"_blank\" title=\"\" rel=\"noopener\">biomedical repositories<\/a> as an example. The list includes information how to submit and access the data.\u00a0\n<\/p>\n<p style=\"text-align: justify\">\n\t<strong>National repositories<\/strong>\n<\/p>\n<ul>\n<li style=\"text-align: justify\">\n\t\t<a href=\"https:\/\/www.ukdataservice.ac.uk\/\">UK <\/a><a href=\"https:\/\/www.ukdataservice.ac.uk\/\">Data Service<\/a>\u00a0 \u2013 in the United Kingdom collects and preserves public open data (population census, health records, data from long-term studies, social and economic data)\n\t<\/li>\n<li style=\"text-align: justify\">\n\t\t<a href=\"https:\/\/www.ands.org.au\/\">ANDS<\/a> \u2013 Australian national repository of research data\n\t<\/li>\n<\/ul>\n<p style=\"text-align: justify\">\n\tI<strong>nstitutional repositories<\/strong>\n<\/p>\n<p style=\"text-align: justify\">\n\tThese repositories are provided by universities.\n<\/p>\n<p style=\"text-align: justify\">\n\tA recent survey shows that researchers generally like to use their university repository, but are just as likely to use universal repositories (international + general-purpose on this graph):\n<\/p>\n<p style=\"text-align: center\">\n\t<img loading=\"lazy\" decoding=\"async\" width=\"753\" height=\"471\" class=\"alignnone wp-image-71\" src=\"https:\/\/sisu.ut.ee\/wp-content\/uploads\/sites\/301\/repos.png\" title=\"repos.png\" alt=\"Repos\" srcset=\"https:\/\/sisu.ut.ee\/wp-content\/uploads\/sites\/301\/repos.png 753w, https:\/\/sisu.ut.ee\/wp-content\/uploads\/sites\/301\/repos-300x188.png 300w\" sizes=\"auto, (max-width: 753px) 100vw, 753px\">\n<\/p>\n<p>\n\tResource:\u00a0European Commission, Directorate-General for Research and Innovation,\u00a0European Research Data Landscape \u2013 Final report, Publications Office of the European Union, 2022,\u00a0<a href=\"https:\/\/data.europa.eu\/doi\/10.2777\/3648\" target=\"_blank\" rel=\"noopener\">https:\/\/data.europa.eu\/doi\/10.2777\/3648<\/a>\n<\/p>\n<blockquote>\n<p style=\"text-align: justify\">\n\t\tUniversity of Tartu repository <a href=\"http:\/\/datadoi.ut.ee\/?locale-attribute=en\">DataDOI<\/a> is a DSpace-based platform for permanent storage of research data. If a researcher cannot find a subject-based environment, which could ensure the preservation their data, it is possible to upload it on DataDOI; the data will also be assigned a DOI and metadata will be registered via DataCite.\u00a0 \u00a0\u00a0\n\t<\/p>\n<\/blockquote>\n<p style=\"text-align: justify\">\n\tThe preservation and accessibility of data is ensured by the University of Tartu Library, which is the administrator of the repository. The administrator also guarantees updating of the repository software, its compliance with standards, the functioning of data exchange, etc. The target groups of this repository are single researchers and subunits\/work groups at the UT and outside, handling the so-called long tail data, whose volume is not very large. The UT researchers can use DataDOI free of charge.\n<\/p>\n<p style=\"text-align: justify\">\n\t<img loading=\"lazy\" decoding=\"async\" width=\"1075\" height=\"431\" class=\"alignnone wp-image-70\" src=\"https:\/\/sisu.ut.ee\/wp-content\/uploads\/sites\/301\/datadoi_eng.png\" title=\"datadoi_eng.png\" alt=\"DataDOI_eng\" srcset=\"https:\/\/sisu.ut.ee\/wp-content\/uploads\/sites\/301\/datadoi_eng.png 1075w, https:\/\/sisu.ut.ee\/wp-content\/uploads\/sites\/301\/datadoi_eng-300x120.png 300w, https:\/\/sisu.ut.ee\/wp-content\/uploads\/sites\/301\/datadoi_eng-1024x411.png 1024w, https:\/\/sisu.ut.ee\/wp-content\/uploads\/sites\/301\/datadoi_eng-768x308.png 768w\" sizes=\"auto, (max-width: 1075px) 100vw, 1075px\">\n<\/p>\n<p style=\"text-align: justify\">\n\tOther data repositories, which are members of the <a data-url=\"https:\/\/datacite.ee\/en\/consortium\/\" href=\"https:\/\/datacite.ee\/en\/consortium\/\" target=\"_blank\" title=\"\" rel=\"noopener\">DataCite Estonia Consortium<\/a> are listed on the <a data-url=\"https:\/\/datacite.ee\/en\/repositories\/\" href=\"https:\/\/datacite.ee\/en\/repositories\/\" target=\"_blank\" title=\"\" rel=\"noopener\">DataCite Estonia web page<\/a>. All these repositories can use the same services as DataDOI.\n<\/p>\n<blockquote>\n<p style=\"text-align: justify\">\n\t\tIf you do not use the above-shown data centres, you should indicate the trustworthiness of the repository you have chosen.\n\t<\/p>\n<\/blockquote>\n<p style=\"text-align: justify\">\n\tQuality repositories can be searched with the help of <a href=\"https:\/\/repositoryfinder.datacite.org\/\">Repository Finder<\/a> where you will get a list of repositories preserving the FAIR data.<br>Several journals also recommend data repositories that meet the quality requirements of the journal, such as <a data-url=\"https:\/\/www.nature.com\/sdata\/policies\/repositories\" href=\"https:\/\/www.nature.com\/sdata\/policies\/repositories\" target=\"_blank\" title=\"\" rel=\"noopener\">Nature Scientific Data Recommended Data Repositories<\/a>.\n<\/p>\n<p style=\"text-align: justify\">\n\tAnother quality indicator of a repository is the <a href=\"https:\/\/www.coretrustseal.org\/why-certification\/certified-repositories\/\">Core Trust Seal certificate<\/a>. The certificate takes into account the organisational and technical structure of the repository, the level of data curation, etc.\n<\/p>\n<h3 style=\"text-align:justify\">\n\tData Preparation<br>\n<\/h3>\n<p style=\"text-align:justify\">\n\tIn order to decide which data should be stored in a repository, you should carefully consider the objective and the period of preservation of your data.\n<\/p>\n<blockquote>\n<p style=\"text-align:justify\">\n\t\tAmong the objectives of storing the data can be the requirements of the grant giver of the journal where you plan to publish, the need for validation, using of the data in teaching, etc.\n\t<\/p>\n<\/blockquote>\n<p style=\"text-align:justify\">\n\tIn general, repositories promise to store and curate the data for ten years. During this time, they ensure the accessibility, interoperability and reusability of data. Naturally, data can be stored for a much longer period, but in such case, not all functions may work anymore.\n<\/p>\n<p style=\"text-align:justify\">\n\tSpecial attention should be paid to the protection of personal data to make sure that the data were anonymised.\n<\/p>\n<p style=\"text-align:justify\">\n\tData can be deleted, and sometimes it even has to be deleted, but there should be an explanation about the reasons for such actions.\n<\/p>\n<p style=\"text-align:justify\">\n\t<strong><span style=\"background-color:#ff8c00\">Additional reading<\/span><\/strong> for those who are about to preserve their data in a repository:\u00a0\n<\/p>\n<p style=\"text-align:justify\">\n\tLahtinen, T., Mela, M., M\u00e4kel\u00e4, M., Nurmi, N., &amp; Kuusniemi, M. E. (2023).\u00a0<strong>How to become a data preserver<\/strong>: The official University of Helsinki guide to the responsible preservation of research data (2.0). Zenodo.\u00a0<a data-auth=\"NotApplicable\" data-linkindex=\"0\" href=\"https:\/\/doi.org\/10.5281\/zenodo.10424017\" rel=\"noopener noreferrer\" target=\"_blank\">https:\/\/doi.org\/10.5281\/zenodo.10424017<\/a>\n<\/p>\n<h3 style=\"text-align: justify\">\n\tServices of Data Centres<br>\n<\/h3>\n<p style=\"text-align: justify\">\n\tLarge repositories often offer <strong>software for processing more common formats<\/strong>. New formats may pose a problem and make data curation more complicated.\n<\/p>\n<p style=\"text-align: justify\">\n\tIn some cases, the use of data format can be negotiated between the researcher and the repository so that the main objective \u2013 data preservation and access to it \u2013 were achieved and the researcher\u2019s needs were met.\u00a0<br>As a rule, data processing will still take place after downloading the data.\n<\/p>\n<p style=\"text-align: justify\">\n\t\u00a0\n<\/p>\n<p style=\"text-align: justify\">\n\tRepositories can also offer <strong>bibliometric data and statistics<\/strong> concerning the downloading and visualisation of a data set.\n<\/p>\n<p style=\"text-align: justify\">\n\tRepositories can help researchers in selecting the most appropriate <strong>licences<\/strong>.\n<\/p>\n<p class=\"Default\" style=\"text-align: justify\">\n\tIn general, repositories use a certain type of deposition licence, specifying the rights that the owner of the data transfers to the repository; this will ensure the quality of data curation. Uploading of data in a repository does not change the ownership of the data.\n<\/p>\n<p class=\"Default\" style=\"text-align: justify\">\n\tLicencing is one of the most important services offered by repositories. This helps the researcher to save much time as both the depositor and the user of the data (compared to asking the consent of every single author when using some other method for data sharing and obtaining).\n<\/p>\n<p class=\"Default\" style=\"text-align: justify\">\n\tThe owner of the data should create an account and register in order to use the repository, and choose the most suitable type of licence.\n<\/p>\n<p class=\"Default\" style=\"text-align: justify\">\n\tIt is also necessary to make sure that the data set and its related documents were similarly licenced.\u00a0\n<\/p>\n<blockquote>\n<p class=\"Default\" style=\"text-align: justify\">\n\t\tWeb pages of repositories include conditions for use and instructions for data uploaders, which the researchers should get familiar with even before starting to collect their data.\n\t<\/p>\n<\/blockquote>\n<p class=\"Default\" style=\"text-align: justify\">\n\t\u00a0The last phase is the preparation of data for long-time preservation.\u00a0\n<\/p>\n<p class=\"Default\" style=\"text-align: justify\">\n\tMargaret Levenstein, director, ICPSR (Inter-university Consortium for Political and Social Research), spoke about the importance of machine-readable DMPs and PIDs for enhancing research practices of graduate students and faculty as well as the usefulness for planning repository services.\n<\/p>\n<p class=\"Default\" style=\"text-align: justify\">\n\t<\/p><div class=\"ratio ratio-16x9 mb-3\"><div class=\"video-placeholder-wrapper video-placeholder-wrapper--16x9\">\n\t\t\t    <div class=\"video-placeholder d-flex justify-content-center align-items-center\">\n\t\t\t        <div class=\"overlay text-white p-2 w-100 text-center d-block justify-content-center align-items-center\">\n\t\t\t            <div>Kolmandate osapoolte sisu n\u00e4gemiseks palun n\u00f5ustu k\u00fcpsistega.<\/div>\n\t\t\t            <button class=\"btn btn-secondary btn-sm mt-1 consent-change\">Muuda n\u00f5usolekut<\/button>\n\t\t\t        <\/div>\n\t\t\t    <\/div>\n\t\t\t<\/div>\n<\/div>\n\n<h3 style=\"text-align: justify\">\n\tOpenAIRE<br>\n<\/h3>\n<p style=\"text-align: justify\">\n\t<a href=\"https:\/\/www.openaire.eu\/\" target=\"_blank\" rel=\"noopener\">OpenAIRE<\/a>\u00a0is a European Union project, supporting Open Access and Open Data and harmonising the Open Science policies in Europe by building the e-infrastructure and the European Science Cloud (EOSC).\n<\/p>\n<p style=\"text-align: justify\">\n\tAll projects, publications and data financed by the European Commission, and the financers, have to be visible and searchable on the OpenAIRE portal, interlinked with each other. For storing the results, OpenAIRE offers the services of the repository\u00a0\u00a0<a href=\"https:\/\/zenodo.org\/\" target=\"_blank\" rel=\"noopener\">Zenodo<\/a>, but the data can be stored in any repository which is able to assign DOIs.\n<\/p>\n<p style=\"text-align: center\">\n\t<img loading=\"lazy\" decoding=\"async\" width=\"1556\" height=\"1344\" class=\"alignnone wp-image-69\" style=\"width: 800px;height: 691px\" src=\"https:\/\/sisu.ut.ee\/wp-content\/uploads\/sites\/301\/openaire_naide_inglise.png\" title=\"openaire_naide_inglise.png\" alt=\"datadoi_inglise\" srcset=\"https:\/\/sisu.ut.ee\/wp-content\/uploads\/sites\/301\/openaire_naide_inglise.png 1556w, https:\/\/sisu.ut.ee\/wp-content\/uploads\/sites\/301\/openaire_naide_inglise-300x259.png 300w, https:\/\/sisu.ut.ee\/wp-content\/uploads\/sites\/301\/openaire_naide_inglise-1024x884.png 1024w, https:\/\/sisu.ut.ee\/wp-content\/uploads\/sites\/301\/openaire_naide_inglise-768x663.png 768w, https:\/\/sisu.ut.ee\/wp-content\/uploads\/sites\/301\/openaire_naide_inglise-1536x1327.png 1536w\" sizes=\"auto, (max-width: 1556px) 100vw, 1556px\">\n<\/p>\n<h3 style=\"text-align:justify\">\n\tDigital Object Identifier DOI<br>\n<\/h3>\n<p style=\"text-align:justify\">\n\tDOI (<em>Digital Object Identifier<\/em>) is a series of numbers, characters and symbols that persistently and unchangeably identifies an article, document, dataset, e-book etc. and refers to it on the web. For example, dataset <span style=\"background:white\">\u201cE-raamatute eelt\u00f6\u00f6deldud ja lemmatiseeritud failid\u201d<\/span> has a DOI <a href=\"http:\/\/dx.doi.org\/10.15155\/re-46\">http:\/\/dx.doi.org\/10.15155\/re-46<\/a> which refers to a web address <a data-url=\" http:\/\/datadoi.ee\/handle\/33\/76\" href=\"http:\/\/datadoi.ee\/handle\/33\/76\" target=\"_blank\" title=\"\" rel=\"noopener\">http:\/\/datadoi.ee\/handle\/33\/76<\/a>. If, for some reason, this web address changes (for example, data repository is moved to a new domain) then DOI stays unchangeable and guarantees that dataset is always findable and accessible from the right web address.\n<\/p>\n<p style=\"text-align:justify\">\n\tEstonian data repositories can assign DOIs to data sets via the DatCite Estonia Consortium. Have a look at the data centres which have the right to assign DOIs and if possible, use these centres:\u00a0<a data-url=\"https:\/\/datacite.ee\/en\/repositories\/\" href=\"https:\/\/datacite.ee\/en\/repositories\/\" target=\"_blank\" title=\"\" rel=\"noopener\">DataCite Estonia data repositories<\/a>.\u00a0\n<\/p>\n<p style=\"text-align:center\">\n\t<img decoding=\"async\" alt=\"datacite\" src=\"https:\/\/sisu.ut.ee\/wp-content\/uploads\/sites\/301\/datacite.png\" style=\"width:197px;height:166px\">\n<\/p>\n<p style=\"text-align:justify\">\n\t<strong>How to get DOI for your research data?<\/strong>\n<\/p>\n<p>\n\tIn Estonia, research data is registered by the DataCite Estonia member universities. The DataCite services are free of charge for the researchers of these universities.\n<\/p>\n<p>\n\tResearchers of other institutions can get DOIs according to the price list.\n<\/p>\n<p>\n\t<strong>Researchers <\/strong>should take the following steps:\n<\/p>\n<ol>\n<li>\n\t\tOrganise their data\n\t<\/li>\n<li>\n\t\tProvide metadata for their dataset\n\t<\/li>\n<li>\n\t\tFind a suitable data centre (subject-based or institutional\n\t<\/li>\n<li>\n\t\tUpload their data following the instructions\n\t<\/li>\n<li>\n\t\tIn case of questions or problems contact the manager of the data centre\n\t<\/li>\n<\/ol>\n<p>\n\tAdditional information and special training are offered by specialists at member university libraries of by the UT Library as a DataCite member.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Research data repositories are data storage facilities which store and curate data sets, ensuring their long-time preservation and access to the data. When planning their data management, it is important for researchers to decide upon the repository where they will &#8230;<\/p>\n","protected":false},"author":78,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_acf_changed":false,"inline_featured_image":false,"footnotes":""},"class_list":["post-3","page","type-page","status-publish","hentry"],"acf":[],"_links":{"self":[{"href":"https:\/\/sisu.ut.ee\/andmehaldus\/wp-json\/wp\/v2\/pages\/3","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sisu.ut.ee\/andmehaldus\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/sisu.ut.ee\/andmehaldus\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/sisu.ut.ee\/andmehaldus\/wp-json\/wp\/v2\/users\/78"}],"replies":[{"embeddable":true,"href":"https:\/\/sisu.ut.ee\/andmehaldus\/wp-json\/wp\/v2\/comments?post=3"}],"version-history":[{"count":1,"href":"https:\/\/sisu.ut.ee\/andmehaldus\/wp-json\/wp\/v2\/pages\/3\/revisions"}],"predecessor-version":[{"id":361,"href":"https:\/\/sisu.ut.ee\/andmehaldus\/wp-json\/wp\/v2\/pages\/3\/revisions\/361"}],"wp:attachment":[{"href":"https:\/\/sisu.ut.ee\/andmehaldus\/wp-json\/wp\/v2\/media?parent=3"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}