<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Sales Data Mart &#8211; Dimensional Model for Retail</title>
	<atom:link href="http://opensourceanalytics.com/2006/04/28/sales-data-mart-dimensional-model-for-retail/feed/" rel="self" type="application/rss+xml" />
	<link>http://opensourceanalytics.com/2006/04/28/sales-data-mart-dimensional-model-for-retail/</link>
	<description>Comprehensive Analytics on Open Source Software.</description>
	<lastBuildDate>Thu, 25 Feb 2010 10:06:00 -0500</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.3</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Srusti</title>
		<link>http://opensourceanalytics.com/2006/04/28/sales-data-mart-dimensional-model-for-retail/comment-page-1/#comment-16494</link>
		<dc:creator>Srusti</dc:creator>
		<pubDate>Wed, 03 Sep 2008 06:55:02 +0000</pubDate>
		<guid isPermaLink="false">http://opensourceanalytics.com/?p=48#comment-16494</guid>
		<description>Hi,

I am contributing towards designing a retail datawarehouse. the functional areas include stores, CRM, merchandising, category management etc. I need help in the dimensional modelling phase. Can anyone help please??</description>
		<content:encoded><![CDATA[<p>Hi,</p>
<p>I am contributing towards designing a retail datawarehouse. the functional areas include stores, CRM, merchandising, category management etc. I need help in the dimensional modelling phase. Can anyone help please??</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Djoni</title>
		<link>http://opensourceanalytics.com/2006/04/28/sales-data-mart-dimensional-model-for-retail/comment-page-1/#comment-15568</link>
		<dc:creator>Djoni</dc:creator>
		<pubDate>Tue, 12 Aug 2008 19:38:11 +0000</pubDate>
		<guid isPermaLink="false">http://opensourceanalytics.com/?p=48#comment-15568</guid>
		<description>Read this book:

http://www.amazon.com/Dimensional-Data-Warehousing-MySQL-Tutorial/dp/0975212826/ref=pd_bbs_sr_1?ie=UTF8&amp;s=books&amp;qid=1218569860&amp;sr=8-1</description>
		<content:encoded><![CDATA[<p>Read this book:</p>
<p><a href="http://www.amazon.com/Dimensional-Data-Warehousing-MySQL-Tutorial/dp/0975212826/ref=pd_bbs_sr_1?ie=UTF8&amp;s=books&amp;qid=1218569860&amp;sr=8-1" rel="nofollow">http://www.amazon.com/Dimensional-Data-Warehousing-MySQL-Tutorial/dp/0975212826/ref=pd_bbs_sr_1?ie=UTF8&amp;s=books&amp;qid=1218569860&amp;sr=8-1</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Naveen</title>
		<link>http://opensourceanalytics.com/2006/04/28/sales-data-mart-dimensional-model-for-retail/comment-page-1/#comment-13282</link>
		<dc:creator>Naveen</dc:creator>
		<pubDate>Mon, 26 May 2008 11:25:46 +0000</pubDate>
		<guid isPermaLink="false">http://opensourceanalytics.com/?p=48#comment-13282</guid>
		<description>Really good Nishith !</description>
		<content:encoded><![CDATA[<p>Really good Nishith !</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nirav</title>
		<link>http://opensourceanalytics.com/2006/04/28/sales-data-mart-dimensional-model-for-retail/comment-page-1/#comment-13280</link>
		<dc:creator>Nirav</dc:creator>
		<pubDate>Mon, 26 May 2008 07:12:49 +0000</pubDate>
		<guid isPermaLink="false">http://opensourceanalytics.com/?p=48#comment-13280</guid>
		<description>hey, guys i have assignment about Dimentions maps and star scema. 

i need a help. I have table from my professor and i am not getting what to do here. pls anyone can help and e-mail me for conatct at nir_prince@hotmail.com will be appriciated. 

pls someone help me.

Nirav</description>
		<content:encoded><![CDATA[<p>hey, guys i have assignment about Dimentions maps and star scema. </p>
<p>i need a help. I have table from my professor and i am not getting what to do here. pls anyone can help and e-mail me for conatct at <a href="mailto:nir_prince@hotmail.com">nir_prince@hotmail.com</a> will be appriciated. </p>
<p>pls someone help me.</p>
<p>Nirav</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nahseez</title>
		<link>http://opensourceanalytics.com/2006/04/28/sales-data-mart-dimensional-model-for-retail/comment-page-1/#comment-13110</link>
		<dc:creator>Nahseez</dc:creator>
		<pubDate>Tue, 20 May 2008 15:36:20 +0000</pubDate>
		<guid isPermaLink="false">http://opensourceanalytics.com/?p=48#comment-13110</guid>
		<description>WELL SAID...</description>
		<content:encoded><![CDATA[<p>WELL SAID&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: JAGDEEP SINGH MORE</title>
		<link>http://opensourceanalytics.com/2006/04/28/sales-data-mart-dimensional-model-for-retail/comment-page-1/#comment-10541</link>
		<dc:creator>JAGDEEP SINGH MORE</dc:creator>
		<pubDate>Mon, 03 Mar 2008 05:03:33 +0000</pubDate>
		<guid isPermaLink="false">http://opensourceanalytics.com/?p=48#comment-10541</guid>
		<description>Hello
I am working as a database engineer in a reputed MNC.i m working over cognos tool.now the work assigned to me is to develop a sales data mart.please help me our howe to make it and from where to start.
thank u</description>
		<content:encoded><![CDATA[<p>Hello<br />
I am working as a database engineer in a reputed MNC.i m working over cognos tool.now the work assigned to me is to develop a sales data mart.please help me our howe to make it and from where to start.<br />
thank u</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nishith</title>
		<link>http://opensourceanalytics.com/2006/04/28/sales-data-mart-dimensional-model-for-retail/comment-page-1/#comment-10122</link>
		<dc:creator>Nishith</dc:creator>
		<pubDate>Thu, 21 Feb 2008 02:26:37 +0000</pubDate>
		<guid isPermaLink="false">http://opensourceanalytics.com/?p=48#comment-10122</guid>
		<description>Yes Mike, you are right.  Most likely the concerns about default value are related to the quality of the source data.  

Bobby,
In some cases it helps to point out that the Data Mart contains processed and transformed data - and replacing NULLS with alternate value can be considered a transformation.  

Also, if you are making a Mart/Warehouse, it is extremely important to keep a detailed and updated Data Dictionary.  These default values, etc. can be noted down against the respective columns in the Data Dictionary.  A common IT Management apprehensions is that the knowledge (of default values, say) would  be lost once the concerned resources are no longer around. This apprehension can be addressed to some extent by keeping the Data Dictionary updated, relevant, available and utilized.</description>
		<content:encoded><![CDATA[<p>Yes Mike, you are right.  Most likely the concerns about default value are related to the quality of the source data.  </p>
<p>Bobby,<br />
In some cases it helps to point out that the Data Mart contains processed and transformed data &#8211; and replacing NULLS with alternate value can be considered a transformation.  </p>
<p>Also, if you are making a Mart/Warehouse, it is extremely important to keep a detailed and updated Data Dictionary.  These default values, etc. can be noted down against the respective columns in the Data Dictionary.  A common IT Management apprehensions is that the knowledge (of default values, say) would  be lost once the concerned resources are no longer around. This apprehension can be addressed to some extent by keeping the Data Dictionary updated, relevant, available and utilized.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: mike</title>
		<link>http://opensourceanalytics.com/2006/04/28/sales-data-mart-dimensional-model-for-retail/comment-page-1/#comment-10113</link>
		<dc:creator>mike</dc:creator>
		<pubDate>Wed, 20 Feb 2008 23:26:10 +0000</pubDate>
		<guid isPermaLink="false">http://opensourceanalytics.com/?p=48#comment-10113</guid>
		<description>Bobby,

Nisrith&#039;s comment on setting a NULL value is also what I had described you would likely want to / need to do.  To clarify my statement (and your question), since a row in the fact table has a multi-part key, you MUST have a legitimate value for the Promotion Id key, otherwise you have no choice but to NOT load the row in the fact table to maintain integrity.

Final thought on the &quot;bad practice&quot; of setting default values.  Sounds like your clients are concerned about setting a default value when the quality of the source data is unknown. In other words, the product may or may not have been promoted - no one knows for sure.  In this case, it is ok to set a value that means &quot;unknown&quot; vs. &quot;not promoted&quot;. In this way, you can always create a report listing of rows with &quot;unknown&quot; for follow-up resolution with the source.


Mike</description>
		<content:encoded><![CDATA[<p>Bobby,</p>
<p>Nisrith&#8217;s comment on setting a NULL value is also what I had described you would likely want to / need to do.  To clarify my statement (and your question), since a row in the fact table has a multi-part key, you MUST have a legitimate value for the Promotion Id key, otherwise you have no choice but to NOT load the row in the fact table to maintain integrity.</p>
<p>Final thought on the &#8220;bad practice&#8221; of setting default values.  Sounds like your clients are concerned about setting a default value when the quality of the source data is unknown. In other words, the product may or may not have been promoted &#8211; no one knows for sure.  In this case, it is ok to set a value that means &#8220;unknown&#8221; vs. &#8220;not promoted&#8221;. In this way, you can always create a report listing of rows with &#8220;unknown&#8221; for follow-up resolution with the source.</p>
<p>Mike</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nishith</title>
		<link>http://opensourceanalytics.com/2006/04/28/sales-data-mart-dimensional-model-for-retail/comment-page-1/#comment-10051</link>
		<dc:creator>Nishith</dc:creator>
		<pubDate>Wed, 20 Feb 2008 01:51:13 +0000</pubDate>
		<guid isPermaLink="false">http://opensourceanalytics.com/?p=48#comment-10051</guid>
		<description>Hi Bobby,

You have correctly pointed out that when you make a report by a particular dimension (say promotion_id), then the report will not show data for records in the fact table having promotion_id column as blank. 

This is a common enough situation, and the solution you have recommended to clients is almost always the right solution (use default values, say NULL, with a corresponding row in the Promotions dimension table).

Outer joins are expensive, and you definitely do not need multiple star-schemas.

I can understand some of the clients being apprehensive, and its upto you to educate them and keep reminding that the data warehouse is unlike any other database they have dealt with in small, yet very critical, ways. 

The &lt;a href=&quot;http://opensourceanalytics.com/2005/11/02/database-vs-data-warehouse/&quot; rel=&quot;nofollow&quot;&gt;differences between OLTP and OLAP systems&lt;/a&gt; need to be understood by IT Management to roll out the best solutions instead of blindly following dogmas that are  only for OLTP solutions. For example, a banking system database would need to be backed up at least daily, whereas a data warehouse with only a monthly batch refresh of data does not need a daily backup.</description>
		<content:encoded><![CDATA[<p>Hi Bobby,</p>
<p>You have correctly pointed out that when you make a report by a particular dimension (say promotion_id), then the report will not show data for records in the fact table having promotion_id column as blank. </p>
<p>This is a common enough situation, and the solution you have recommended to clients is almost always the right solution (use default values, say NULL, with a corresponding row in the Promotions dimension table).</p>
<p>Outer joins are expensive, and you definitely do not need multiple star-schemas.</p>
<p>I can understand some of the clients being apprehensive, and its upto you to educate them and keep reminding that the data warehouse is unlike any other database they have dealt with in small, yet very critical, ways. </p>
<p>The <a href="http://opensourceanalytics.com/2005/11/02/database-vs-data-warehouse/" rel="nofollow">differences between OLTP and OLAP systems</a> need to be understood by IT Management to roll out the best solutions instead of blindly following dogmas that are  only for OLTP solutions. For example, a banking system database would need to be backed up at least daily, whereas a data warehouse with only a monthly batch refresh of data does not need a daily backup.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: bobby</title>
		<link>http://opensourceanalytics.com/2006/04/28/sales-data-mart-dimensional-model-for-retail/comment-page-1/#comment-9939</link>
		<dc:creator>bobby</dc:creator>
		<pubDate>Sun, 17 Feb 2008 14:26:17 +0000</pubDate>
		<guid isPermaLink="false">http://opensourceanalytics.com/?p=48#comment-9939</guid>
		<description>Mike,

Thank you for the answer...

however I am not understanding why you say that &quot;if a store is not running a Promotion on a given product during a given day/week you won&#039;t have a row (transaction) in the fact table&quot;

In my understanding if the store sold a product with no promotion then there would in fact be a row (transaction) in the fact table - it would have valid values for other measures including store_id, customer_id, product_id, etc.

Again my understanding is that one could measure sales across any of these dimensions.  So I might have a report for sales by store_id or sales by product_id. 

I would want to see my row from the store that does not have a promotion id in these reports. 

However I am still left with two choices when I go to add the dimension of promotion id;

1. Left outer join between fact and promotion id table

2. not picking up the row (transaction) or rows that don&#039;t have a promotion id

both seem bad options so my recommendation to clients is often to use a default value.

however many clients have refused this recommendation as bad practice (i.e. inserting a value when no value was present on the source data).

I am just wondering if there is one &quot;best practice&quot; way to do this when assembling star schemas.

Thanks again

bobby</description>
		<content:encoded><![CDATA[<p>Mike,</p>
<p>Thank you for the answer&#8230;</p>
<p>however I am not understanding why you say that &#8220;if a store is not running a Promotion on a given product during a given day/week you won&#8217;t have a row (transaction) in the fact table&#8221;</p>
<p>In my understanding if the store sold a product with no promotion then there would in fact be a row (transaction) in the fact table &#8211; it would have valid values for other measures including store_id, customer_id, product_id, etc.</p>
<p>Again my understanding is that one could measure sales across any of these dimensions.  So I might have a report for sales by store_id or sales by product_id. </p>
<p>I would want to see my row from the store that does not have a promotion id in these reports. </p>
<p>However I am still left with two choices when I go to add the dimension of promotion id;</p>
<p>1. Left outer join between fact and promotion id table</p>
<p>2. not picking up the row (transaction) or rows that don&#8217;t have a promotion id</p>
<p>both seem bad options so my recommendation to clients is often to use a default value.</p>
<p>however many clients have refused this recommendation as bad practice (i.e. inserting a value when no value was present on the source data).</p>
<p>I am just wondering if there is one &#8220;best practice&#8221; way to do this when assembling star schemas.</p>
<p>Thanks again</p>
<p>bobby</p>
]]></content:encoded>
	</item>
</channel>
</rss>
