{"id":18,"date":"2024-04-04T04:36:11","date_gmt":"2024-04-04T01:36:11","guid":{"rendered":"https:\/\/sisu.ut.ee\/andmehaldus\/open-research-data\/"},"modified":"2024-04-16T17:55:04","modified_gmt":"2024-04-16T14:55:04","slug":"open-research-data","status":"publish","type":"page","link":"https:\/\/sisu.ut.ee\/andmehaldus\/open-research-data\/?lang=en","title":{"rendered":"Open Research Data"},"content":{"rendered":"<p style=\"text-align: justify\">\n\tThe word \u2019data\u2019 (singular \u2019datum\u2019) originates from Latin, literally: (things) given, from <em>dare <\/em>to give. (Collins English Dictionary).\n<\/p>\n<p style=\"text-align: justify\">\n\tEssentially, data is something that describe the objectively existing matters.\n<\/p>\n<blockquote>\n<p style=\"text-align: justify\">\n\t\tData is raw material, which enable the structuring, categorising and measuring of the world. Data become valuable when people use it.\n\t<\/p>\n<\/blockquote>\n<blockquote>\n<p style=\"text-align: justify\">\n\t\tShortly Reserach Data is any information collected and processed to produce original research results.\n\t<\/p>\n<\/blockquote>\n<p style=\"margin-left: 40px;text-align: justify\">\n\t\u00a0\n<\/p>\n<p>\n\tIn\u00a0<a data-url=\"https:\/\/www.ukri.org\/wp-content\/uploads\/2020\/10\/UKRI-020920-ConcordatonOpenResearchData.pdf\" href=\"https:\/\/www.ukri.org\/wp-content\/uploads\/2020\/10\/UKRI-020920-ConcordatonOpenResearchData.pdf\" target=\"_blank\" title=\"\" rel=\"noopener\">Concordat on Open Research Data<\/a>, developed in UK,\u00a0the following definition of research data\u00a0has\u00a0been adopted:\u00a0\n<\/p>\n<p>\n\t<strong>Research data<\/strong> are the evidence that underpins the answer to the research question, and can be used to validate findings regardless of its form (e.g. print, digital, or physical). These might be quantitative information or qualitative statements collected by researchers in the course of their work by experimentation, observation, modelling, interview or other methods, or information derived from existing evidence. Data may be raw or primary (e.g. direct from measurement or collection) or derived from primary data for subsequent analysis or interpretation (e.g. cleaned up or as an extract from a larger data set), or derived from existing sources where the rights may be held by others. Data may be defined as \u2018relational\u2019 or \u2018functional\u2019 components of research, thus signalling that their identification and value lies in whether and how researchers use them as evidence for claims. They may include, for example, statistics, collections of digital images, sound recordings, transcripts of interviews, survey data and fieldwork observations with appropriate annotations, an interpretation, an artwork, archives, found objects, published texts or a manuscript.\n<\/p>\n<p style=\"margin-left: 40px;text-align: justify\">\n\t\u00a0\n<\/p>\n<p style=\"text-align: justify\">\n\t<strong>Open research data<\/strong>\u00a0is the research data made accessible to others via Open Access, for sharing and re-using, provided that re-users would cite the original data. We can say that the main characteristic by which we can define open research data is its <strong>accessibility<\/strong>.\n<\/p>\n<p style=\"text-align: justify\">\n\t<strong>Open data of the public sector<\/strong> (public data, government data) are the data created by the public sector and held in data sets without any limits to their accessibility. They form the basis for public services and decisions. When public data are used in research, they also become part of research data. The publc data can be found in\u00a0<a data-url=\"https:\/\/opendata.riik.ee\/en\/\" href=\"https:\/\/opendata.riik.ee\/en\/\" target=\"_blank\" title=\"\" rel=\"noopener\">Estonian Open Government Data Portal<\/a>.\n<\/p>\n<p style=\"text-align: justify\">\n\tData can be classified based on different characteristics:\n<\/p>\n<ul>\n<li>\n<p style=\"text-align: justify\">\n\t\t\t<strong>Quantitative and qualitative data<\/strong>\n\t\t<\/p>\n<\/li>\n<\/ul>\n<p style=\"margin-left: 80px;text-align: justify\">\n\t<strong>Quantitative data<\/strong> can be expressed in numbers \u2013 measures or weights, ratio scale (1-100), yes\/no decisions, etc.\n<\/p>\n<p style=\"margin-left: 80px;text-align: justify\">\n\t<strong>Qualitative data<\/strong> cannot immediately be expressed in numbers, such as text, image, video, etc. Qualitative data can be analysed and converted into numerical data.\n<\/p>\n<ul>\n<li>\n<p style=\"text-align: justify\">\n\t\t\t<strong>Structured and unstructured data<\/strong>\n\t\t<\/p>\n<\/li>\n<\/ul>\n<p style=\"margin-left: 80px;text-align: justify\">\n\t<strong>Structured data<\/strong> is the data organised in a way where each variable has a corresponding value and these variable-value pairs form a database. Such data can be computer-processed and searched by using algorithms and query languages. The result of data processing can be visualised and used for creating models and simulations.\n<\/p>\n<p style=\"margin-left: 80px;text-align: justify\">\n\tStructured data are related to the term \u2018<strong>linked data<\/strong>\u2019. Data are structured in a way which allows to link them according to a standard; together they form the semantic web. <a href=\"https:\/\/wiki.dbpedia.org\/\" target=\"_blank\" rel=\"noopener\">DBPedia<\/a> is an example of open linked data.\n<\/p>\n<p style=\"margin-left: 80px;text-align: justify\">\n\t<strong>Unstructured data<\/strong>\u00a0have no fixed model or structure, it is difficult to process it with computers.\n<\/p>\n<ul>\n<li>\n<p style=\"text-align: justify\">\n\t\t\t<strong>Primary and secondary data<\/strong>\n\t\t<\/p>\n<\/li>\n<\/ul>\n<p style=\"margin-left: 80px;text-align: justify\">\n\t<strong>Primary data<\/strong> are the data collected by the researchers themselves. Data which have not yet been processed or analysed are called <strong>raw data.<\/strong>\n<\/p>\n<p style=\"margin-left: 80px;text-align: justify\">\n\t<strong>Secondary data<\/strong> is data previously collected by the researcher himself or by someone else. It is a reuse of data.\n<\/p>\n<ul>\n<li>\n<p style=\"text-align: justify\">\n\t\t\t<strong>Metadata<\/strong>\n\t\t<\/p>\n<\/li>\n<\/ul>\n<p style=\"margin-left: 80px;text-align: justify\">\n\t<strong>Metadata<\/strong> are the data about data, which should be openly accessible to all even if the original data are not accessible.\n<\/p>\n<ul>\n<li>\n<p style=\"text-align: justify\">\n\t\t\t<strong>Big data and \u201clong tail\u201d data<\/strong>\n\t\t<\/p>\n<\/li>\n<\/ul>\n<p style=\"margin-left: 80px;text-align: justify\">\n\t<strong>Big data<\/strong> are characterised by a large volume of data, continuous and high speed of data processing, and extremely various types of data (3V: Volume, Velocity, Variety). Big data are the structured data which can be automatically processed.\n<\/p>\n<p style=\"margin-left: 80px;text-align: justify\">\n\tFor example, data from the Internet of Things (IoT), meaning the data from devices that are constantly saving and sharing data, can be called big data.\n<\/p>\n<p style=\"margin-left: 80px;text-align: justify\">\n\tBig data can also be open data, but its <strong>main characteristic is its large volume<\/strong>.\n<\/p>\n<p style=\"margin-left: 80px;text-align: justify\">\n\tHowever, most of the present-day research is dealing with data of smaller volume, which is called \u201c<strong>long tail data<\/strong>\u201d, because when visualised on a graph, where one axis is the volume of data and the other axis is the number of relevant data sets, the customary research forms this long and thin tail on the number axis. The volume of the data is small, only certain types of data are collected and processed at the same time, the data collecting and processing methods are traditional and they may be tailored for studying some certain problem.\n<\/p>\n<p style=\"text-align: justify\">\n\tCulina, A., M. Baglioni, T.W. Crowther, M.E. Visser, S. Woutersen-Windhouwer, and P.\u00a0Manghi. 2018. Navigating the unfolding open data landscape in ecology and evolution.\u00a0Nature Ecology &amp; Evolution 2: 420-426\n<\/p>\n<p style=\"text-align: justify;margin-left: 40px\">\n\t<em>Long tail of science.<\/em><em>\u00a0Dispersed scientific research that is conducted by many individual researchers\/teams, and is often of a limited spatial and temporal scale. Data produced in the long tail tend to be small in volume, and less standardized within the same field of study. The majority of scientific funding is spent on this type of research<\/em>.\n<\/p>\n<p style=\"text-align: justify;margin-left: 40px\">\n\t\u00a0\n<\/p>\n<p style=\"text-align: center\">\n\t<img decoding=\"async\" alt=\"data\" src=\"https:\/\/sisu.ut.ee\/wp-content\/uploads\/sites\/301\/long_tail.png\" style=\"width: 680px;height: 415px\">\n<\/p>\n<p style=\"text-align: justify\">\n\tFerguson, A. R., Nielson, J. L., Cragin, M. H., Bandrowski, A. E. &amp; Martone, M. E. <a data-url=\"https:\/\/www.nature.com\/articles\/nn.3838\" href=\"https:\/\/www.nature.com\/articles\/nn.3838\" target=\"_blank\" title=\"\" rel=\"noopener\">Big data from small data: data-sharing in the \u2018long tail\u2019 of neuroscience<\/a>. Nat. Neurosci. 17, 1442\u20131447 (2014)<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The word \u2019data\u2019 (singular \u2019datum\u2019) originates from Latin, literally: (things) given, from dare to give. (Collins English Dictionary). Essentially, data is something that describe the objectively existing matters. Data is raw material, which enable the structuring, categorising and measuring of &#8230;<\/p>\n","protected":false},"author":78,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_acf_changed":false,"inline_featured_image":false,"footnotes":""},"class_list":["post-18","page","type-page","status-publish","hentry"],"acf":[],"_links":{"self":[{"href":"https:\/\/sisu.ut.ee\/andmehaldus\/wp-json\/wp\/v2\/pages\/18","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sisu.ut.ee\/andmehaldus\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/sisu.ut.ee\/andmehaldus\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/sisu.ut.ee\/andmehaldus\/wp-json\/wp\/v2\/users\/78"}],"replies":[{"embeddable":true,"href":"https:\/\/sisu.ut.ee\/andmehaldus\/wp-json\/wp\/v2\/comments?post=18"}],"version-history":[{"count":1,"href":"https:\/\/sisu.ut.ee\/andmehaldus\/wp-json\/wp\/v2\/pages\/18\/revisions"}],"predecessor-version":[{"id":346,"href":"https:\/\/sisu.ut.ee\/andmehaldus\/wp-json\/wp\/v2\/pages\/18\/revisions\/346"}],"wp:attachment":[{"href":"https:\/\/sisu.ut.ee\/andmehaldus\/wp-json\/wp\/v2\/media?parent=18"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}