<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Open Source Analytics &#187; DecisionStudio-Professional</title>
	<atom:link href="http://opensourceanalytics.com/category/decisionstudio-professional/feed/" rel="self" type="application/rss+xml" />
	<link>http://opensourceanalytics.com</link>
	<description>Comprehensive Analytics on Open Source Software.</description>
	<lastBuildDate>Tue, 25 Sep 2007 15:12:42 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.3</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Sales Data Mart &#8211; Dimensional Model for Retail</title>
		<link>http://opensourceanalytics.com/2006/04/28/sales-data-mart-dimensional-model-for-retail/</link>
		<comments>http://opensourceanalytics.com/2006/04/28/sales-data-mart-dimensional-model-for-retail/#comments</comments>
		<pubDate>Fri, 28 Apr 2006 06:12:08 +0000</pubDate>
		<dc:creator>Nishith</dc:creator>
				<category><![CDATA[Data Warehousing]]></category>
		<category><![CDATA[DecisionStudio-Professional]]></category>
		<category><![CDATA[Open Source Analytics]]></category>

		<guid isPermaLink="false">http://opensourceanalytics.com/?p=48</guid>
		<description><![CDATA[If you have followed some of the earlier posts, you would remember that a data mart is created as a star schema through a process known as dimensional modeling.  In this post we will create a dimensional model for Sales data mart at a hypothetical retailer.  
Note:  To go through the examples [...]]]></description>
			<content:encoded><![CDATA[<p>If you have followed some of the earlier posts, you would remember that a data mart is created as a star schema through a process known as dimensional modeling.  In this post we will create a dimensional model for Sales data mart at a hypothetical retailer.  <span id="more-48"></span></p>
<p><strong>Note:</strong>  To go through the examples here, you would need MySQL database and DB Designer for database modeling.  You can either install <a href="http://decisionstudio.com/product" target="_blank"><strong>DecisionStudio Professional</strong></a> (<a href="https://sourceforge.net/projects/ds-professional" target="_blank">download from sourceforge</a>), which has both MySQL and DB Designer along with many other analytics goodies, or else you can install them individually from <a href="http://mysql.com" target="_blank">MySQL website</a> and <a href="http://fabforce.net/downloads.php" target="_blank">DB Designer website</a>.  You would also need to <a href="https://sourceforge.net/projects/ds-professional" target="_blank">download the sample foodmart database</a> available along with DecisionStudio Professional.</p>
<p>Now let&#8217;s assume you are an IT person at FoodMart (a hypothetical retailer) who has decided to build a sales data mart as the first step in rolling out comprehensive analytics.  In discussions with the sales department you have figured that the <strong>no. of units sold, dollar amount of sales, and the number of unique customers </strong>in a segment are the main metrics they look at.  Digging deeper you figure that the sales guys are likely to want <strong>analysis by product, product category/class, brand, store location (city, state, region, country, &#8230;), customer demographics, and also by individual promotions and promotion categories</strong>.  It may not be explicitly mentioned, but the metrics would also be analyzed by time (day, week, month, quarter, &#8230;)</p>
<p>Now that you have figured out the business metrics to be measured, this gives you the facts you would need in the data mart <strong>&#8216;fact table&#8217; </strong>for calculating them.  Similarly you have figured out the potential segments for analysis, and that gives you the <strong>&#8216;dimensions&#8217; </strong>for analysis.  The &#8216;fact table&#8217; linked to the &#8216;dimension tables&#8217; makes up the <strong>&#8217;star-schema&#8217;</strong> (because of the star-like structure), also known as the data mart.</p>
<p>With this information in place, we have the high level <strong>Dimensional Model for Sales</strong>.<br />
<a href="http://opensourceanalytics.com/wordpress/wp-content/FoodMartDimensionalModelSales.PNG" target="_blank"><img src='http://opensourceanalytics.com/wordpress/wp-content/thumb-FoodMartDimensionalModelSales.PNG' alt='Dimensional Model for Sales Cube at FoodMart' /></a></p>
<p>Sales_Fact_1998 is the main fact table that has sales information by store/location, product, time, customer, and promotion.  Correspondingly there are 5 dimension tables joined to the fact table through foreign keys in the star-schema.</p>
<p>The dimension tables in turn have detailed data that can now be used for <strong>defining ad-hoc analysis segments</strong>.  For example, we can put demographic filters on the customer dimension (say age&lt;30, married, college-educated), choose specific product class(es) in the product table (say Dairy Products), specify a limited time period, and then get our metrics calculated for the ad-hoc segment.</p>
<p>The image below shows the detailed information available in the dimension tables for defining ad-hoc segments.<br />
<a href="http://opensourceanalytics.com/wordpress/wp-content/FoodMartSalesCubeData.PNG" target="_blank"><img src='http://opensourceanalytics.com/wordpress/wp-content/thumb-FoodMartSalesCubeData.PNG' alt='Dimensional Data for analysis in Sales Cube' /></a></p>
<p>You can <a href="http://opensourceanalytics.com/wordpress/wp-content/FoodMartDatabaseModel.xml" title='Sales Star Schema, and FoodMart database schema' target="_blank">download the data model here</a>, and then open the saved model using DB Designer in DecisionStudio Professional (Start -> Program Files -> DecisionStudio Professional -> Data Analyst -> DB Designer Workbench).  You can see other tables in the FoodMart database by scrolling around on the canvas (scroller in top-right corner).  </p>
<p>Do note that our dimensional model for sales covers only a small relevant set of tables from the entire FoodMart database.  You can load the entire downloaded FoodMart data into MySQL <a href="http://decisionstudio.com/wiki/doku.php?id=restoring_foodmart_data" target="_blank">as outlined here</a>, and can query on the data using Query Browser (Start -> Program Files -> DecisionStudio Professional -> Data Analyst -> MySQL Query Browser).  </p>
]]></content:encoded>
			<wfw:commentRss>http://opensourceanalytics.com/2006/04/28/sales-data-mart-dimensional-model-for-retail/feed/</wfw:commentRss>
		<slash:comments>33</slash:comments>
		</item>
		<item>
		<title>Data Mining &#8211; Desktop Survival Guide</title>
		<link>http://opensourceanalytics.com/2006/03/10/data-mining-desktop-survival-guide/</link>
		<comments>http://opensourceanalytics.com/2006/03/10/data-mining-desktop-survival-guide/#comments</comments>
		<pubDate>Fri, 10 Mar 2006 08:19:38 +0000</pubDate>
		<dc:creator>Nishith</dc:creator>
				<category><![CDATA[DecisionStudio-Professional]]></category>
		<category><![CDATA[Modeling]]></category>

		<guid isPermaLink="false">http://opensourceanalytics.com/2006/03/10/data-mining-desktop-survival-guide/</guid>
		<description><![CDATA[A good resource on Data Mining using R environment is &#8220;Data Mining &#8211; Desktop Survival Guide&#8221; by Graham Williams.  Its a well researched and frequently updated free online book.  Graham does a good job of going through the data mining process and provides detailed descriptions of the commonly used algorithms.  
Exmaples are [...]]]></description>
			<content:encoded><![CDATA[<p>A good resource on Data Mining using R environment is <a href="http://togaware.com/datamining/survivor/Contents.html">&#8220;Data Mining &#8211; Desktop Survival Guide&#8221; by Graham Williams</a>.  Its a well researched and frequently updated free online book.  Graham does a good job of going through the data mining process and provides detailed descriptions of the commonly used algorithms.  </p>
<p>Exmaples are in R (all with R source code, of course), with some good examples of graphical visualization as well.</p>
<p><a href="http://togaware.com/datamining/survivor/Contents.html">Click here for Data Mining &#8211; Desktop Survival Guide</a></p>
<p>R Environment is a part of <a href="http://opensourceanalytics.com/2006/02/28/decisionstudio-professional-desktop-bi-platform/">DecisionStudio-Professional</a> and can also be separately downloaded from <a href="http://r-project.org">http://r-project.org</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://opensourceanalytics.com/2006/03/10/data-mining-desktop-survival-guide/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Econometrics on R Environment</title>
		<link>http://opensourceanalytics.com/2006/03/07/econometrics-on-r-environment/</link>
		<comments>http://opensourceanalytics.com/2006/03/07/econometrics-on-r-environment/#comments</comments>
		<pubDate>Mon, 06 Mar 2006 20:56:15 +0000</pubDate>
		<dc:creator>Nishith</dc:creator>
				<category><![CDATA[DecisionStudio-Professional]]></category>
		<category><![CDATA[Modeling]]></category>
		<category><![CDATA[Open Source Analytics]]></category>

		<guid isPermaLink="false">http://opensourceanalytics.com/2006/03/07/econometrics-on-r-environment/</guid>
		<description><![CDATA[I would, in due course, spend some time on the R Environment (available as part of DecisionStudio Professional, and separately downloadable from http://r-project.org).  R provides an excellent alternative to commercial products for modeling, statistical analysis, and graphics.  Originally designed by AT&#038;T Bell Labs, the R environment is fast becoming the standard for cutting [...]]]></description>
			<content:encoded><![CDATA[<p>I would, in due course, spend some time on the R Environment (available as part of <a href="http://decisionstudio.com/product">DecisionStudio Professional</a>, and separately downloadable from <a href="http://r-project.org">http://r-project.org</a>).  R provides an excellent alternative to commercial products for modeling, statistical analysis, and graphics.  Originally designed by AT&#038;T Bell Labs, the R environment is fast becoming the standard for cutting edge number crunching.<br />
<span id="more-32"></span><br />
Bio-informatics is one area of dominance, and a lot of academic papers these days come with R based/compatible implementations.  (So you don&#8217;t have to wait for your commercial BI vendor to first discover that new technique, make up its mind about it, and then finally get a bunch of disinterested code-monkeys to write a new library for, luckily just in time for the release after next).</p>
<p>While R is used a lot for statistical research, its adoption for econometrics has been comparatively slower, possibly due to terminology differences.</p>
<p>An excellent text called <a href="http://cran.r-project.org/doc/contrib/Farnsworth-EconometricsInR.pdf">&#8220;Econometrics in R&#8221; by Grant V. Farnsworth (PDF)</a> provides a effective hands-on intro to the most common things you would find yourself doing including Time Series, regressions, Plotting, etc.  Have a look.</p>
]]></content:encoded>
			<wfw:commentRss>http://opensourceanalytics.com/2006/03/07/econometrics-on-r-environment/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>DecisionStudio Professional &#8211; Desktop BI Platform</title>
		<link>http://opensourceanalytics.com/2006/02/28/decisionstudio-professional-desktop-bi-platform/</link>
		<comments>http://opensourceanalytics.com/2006/02/28/decisionstudio-professional-desktop-bi-platform/#comments</comments>
		<pubDate>Tue, 28 Feb 2006 17:39:34 +0000</pubDate>
		<dc:creator>Nishith</dc:creator>
				<category><![CDATA[BI, Data Mining, Analytics]]></category>
		<category><![CDATA[Data Warehousing]]></category>
		<category><![CDATA[DecisionStudio-Professional]]></category>
		<category><![CDATA[Modeling]]></category>
		<category><![CDATA[On Your Own]]></category>
		<category><![CDATA[Open Source Analytics]]></category>
		<category><![CDATA[Reporting]]></category>

		<guid isPermaLink="false">http://opensourceanalytics.com/2006/02/28/decisionstudio-professional-desktop-bi-platform/</guid>
		<description><![CDATA[On Friday we released DecisionStudio Professional &#8211; a comprehensive and free desktop BI Platform that gives you all the tools needed for analytics under a single package licensed under GNU Public License (GPL).
DecisionStudio Professional (DSP) is an advanced graphical data mining, reporting, modeling, and analysis environment built on top of the best-of-breed open source projects. [...]]]></description>
			<content:encoded><![CDATA[<p>On Friday we released <a href="http://decisionstudio.com/product" target="_blank"><strong>DecisionStudio Professional</strong></a> &#8211; a comprehensive and free <strong>desktop BI Platform </strong>that gives you all the tools needed for analytics under a single package licensed under <strong>GNU Public License (GPL).</strong></p>
<p>DecisionStudio Professional (DSP) is an advanced <strong>graphical data mining, reporting, modeling, and analysis environment </strong>built on top of the best-of-breed open source projects.  Some of these include:<br />
      &#8212;  <strong>Optimized MySQL database </strong>as data warehouse platform<br />
      &#8212;  <strong>SQL Workbench</strong> (MySQL Query Browser and DBDesigner) for Data Analysts<br />
      &#8212;  <strong>R environment </strong>for statistical analysis and modeling<br />
      &#8212;  <strong>iReport </strong>Reporting GUI and <strong>JasperReport </strong>reporting library<br />
      &#8212;  <strong>Python </strong>with <strong>Boa Constructor IDE </strong>for application and GUI development</p>
<p>DecisionStudio Professional is the only <strong>end-to-end open source analytics platform </strong>that provides comprehensive capabilities to each role.  Data Analysts get to store, process, and publish data on a standard MySQL platform; Reporting Analysts would like iReport and the integration with Office tools; and Modelers would love the excellent R Environment.  It also includes Python along with a drag-n-drop GUI building environment for analytics Application Developers.</p>
<p>You can <a href="http://decisionstudio.com/product" target="_blank"><strong>find out more about DecisionStudio Professional at decisionstudio.com</strong></a>, and can <a href="https://sourceforge.net/projects/ds-professional" target="_blank"><strong>download your copy at Sourceforge.net</strong></a>.   <a href="http://decisionstudio.com/site/wp-content/decisionstudio-professional.pdf" target="_blank">Click here to download the product brochure (PDF).</a> </p>
<p>Go ahead, it&#8217;s completely free and will always stay so.  <img src='http://opensourceanalytics.com/wordpress/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://opensourceanalytics.com/2006/02/28/decisionstudio-professional-desktop-bi-platform/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
	</channel>
</rss>
