{"id":7989,"date":"2020-09-12T17:55:59","date_gmt":"2020-09-12T17:55:59","guid":{"rendered":"http:\/\/onthe8spot.com\/?p=7989"},"modified":"2020-09-12T17:55:59","modified_gmt":"2020-09-12T17:55:59","slug":"data-quality-implementation-in-data-warehouses-toptal","status":"publish","type":"post","link":"http:\/\/onthe8spot.com\/index.php\/2020\/09\/12\/data-quality-implementation-in-data-warehouses-toptal\/","title":{"rendered":"Data Quality Implementation in Data Warehouses &#124; Toptal"},"content":{"rendered":"<blockquote>\n<h3 id=\"data-quality-dimensions\">Data Quality Dimensions<\/h3>\n<p><strong>DQ dimensions<\/strong>\u00a0are a common way to identify and cluster DQ checks. There are many definitions, and the number of dimensions varies considerably: You might find 16, or even more dimensions. From a practical perspective, it is less confusing to start with a few dimensions and find a general understanding of them among your users.<\/p>\n<ul>\n<li><strong>Completeness:<\/strong>\u00a0Is all the data required available and accessible? Are all sources needed available and loaded? Was data lost between stages?<\/li>\n<li><strong>Consistency:<\/strong>\u00a0Is there erroneous\/conflicting\/inconsistent data? For example, the termination date of a contract in a \u201cTerminated\u201d state must contain a valid date higher than or equal to the start date of the contract.<\/li>\n<li><strong>Uniqueness:<\/strong>\u00a0Are there any duplicates?<\/li>\n<li><strong>Integrity:<\/strong>\u00a0Is all data linked correctly? For example, are there orders linking to nonexistent customer IDs (a classic referential integrity problem)?<\/li>\n<li><strong>Timeliness:<\/strong>\u00a0Is the data current? For example, in a data warehouse with daily updates, I would expect yesterday\u2019s data available today.<\/li>\n<\/ul>\n<\/blockquote>\n<p>Source: <em><a href=\"https:\/\/www.toptal.com\/database\/data-warehouse-data-quality-process?utm_campaign=Toptal%20Engineering%20Blog&amp;utm_medium=email&amp;_hsmi=94506066&amp;_hsenc=p2ANqtz--MAfFztBBuP7RnOugj16RqZhYg5z1-ic94za04vvscwBaXjMmEtBJv-45-7XR_gvt5bB7E__7XDs_XSwa5ife_H5YIZhgrZuqiegpOTF70zJmiH1k&amp;utm_content=94506066&amp;utm_source=hs_email\">Data Quality Implementation in Data Warehouses | Toptal<\/a><\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Data Quality Dimensions DQ dimensions\u00a0are a common way to identify and cluster DQ checks. There are many definitions, and the number of dimensions varies considerably: You might find 16, or even more dimensions. From a practical perspective, it is less confusing to start with a few dimensions and find a general understanding of them among &hellip; <\/p>\n<p class=\"link-more\"><a href=\"http:\/\/onthe8spot.com\/index.php\/2020\/09\/12\/data-quality-implementation-in-data-warehouses-toptal\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;Data Quality Implementation in Data Warehouses &#124; Toptal&#8221;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[62],"tags":[],"class_list":["post-7989","post","type-post","status-publish","format-standard","hentry","category-personal-angol"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"http:\/\/onthe8spot.com\/index.php\/wp-json\/wp\/v2\/posts\/7989","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/onthe8spot.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/onthe8spot.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/onthe8spot.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/onthe8spot.com\/index.php\/wp-json\/wp\/v2\/comments?post=7989"}],"version-history":[{"count":0,"href":"http:\/\/onthe8spot.com\/index.php\/wp-json\/wp\/v2\/posts\/7989\/revisions"}],"wp:attachment":[{"href":"http:\/\/onthe8spot.com\/index.php\/wp-json\/wp\/v2\/media?parent=7989"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/onthe8spot.com\/index.php\/wp-json\/wp\/v2\/categories?post=7989"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/onthe8spot.com\/index.php\/wp-json\/wp\/v2\/tags?post=7989"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}