<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Open Source Analytics &#187; Modeling</title>
	<atom:link href="http://opensourceanalytics.com/category/modeling/feed/" rel="self" type="application/rss+xml" />
	<link>http://opensourceanalytics.com</link>
	<description>Comprehensive Analytics on Open Source Software.</description>
	<lastBuildDate>Tue, 25 Sep 2007 15:12:42 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.3</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Data Mining &#8211; Desktop Survival Guide</title>
		<link>http://opensourceanalytics.com/2006/03/10/data-mining-desktop-survival-guide/</link>
		<comments>http://opensourceanalytics.com/2006/03/10/data-mining-desktop-survival-guide/#comments</comments>
		<pubDate>Fri, 10 Mar 2006 08:19:38 +0000</pubDate>
		<dc:creator>Nishith</dc:creator>
				<category><![CDATA[DecisionStudio-Professional]]></category>
		<category><![CDATA[Modeling]]></category>

		<guid isPermaLink="false">http://opensourceanalytics.com/2006/03/10/data-mining-desktop-survival-guide/</guid>
		<description><![CDATA[A good resource on Data Mining using R environment is &#8220;Data Mining &#8211; Desktop Survival Guide&#8221; by Graham Williams.  Its a well researched and frequently updated free online book.  Graham does a good job of going through the data mining process and provides detailed descriptions of the commonly used algorithms.  
Exmaples are [...]]]></description>
			<content:encoded><![CDATA[<p>A good resource on Data Mining using R environment is <a href="http://togaware.com/datamining/survivor/Contents.html">&#8220;Data Mining &#8211; Desktop Survival Guide&#8221; by Graham Williams</a>.  Its a well researched and frequently updated free online book.  Graham does a good job of going through the data mining process and provides detailed descriptions of the commonly used algorithms.  </p>
<p>Exmaples are in R (all with R source code, of course), with some good examples of graphical visualization as well.</p>
<p><a href="http://togaware.com/datamining/survivor/Contents.html">Click here for Data Mining &#8211; Desktop Survival Guide</a></p>
<p>R Environment is a part of <a href="http://opensourceanalytics.com/2006/02/28/decisionstudio-professional-desktop-bi-platform/">DecisionStudio-Professional</a> and can also be separately downloaded from <a href="http://r-project.org">http://r-project.org</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://opensourceanalytics.com/2006/03/10/data-mining-desktop-survival-guide/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Econometrics on R Environment</title>
		<link>http://opensourceanalytics.com/2006/03/07/econometrics-on-r-environment/</link>
		<comments>http://opensourceanalytics.com/2006/03/07/econometrics-on-r-environment/#comments</comments>
		<pubDate>Mon, 06 Mar 2006 20:56:15 +0000</pubDate>
		<dc:creator>Nishith</dc:creator>
				<category><![CDATA[DecisionStudio-Professional]]></category>
		<category><![CDATA[Modeling]]></category>
		<category><![CDATA[Open Source Analytics]]></category>

		<guid isPermaLink="false">http://opensourceanalytics.com/2006/03/07/econometrics-on-r-environment/</guid>
		<description><![CDATA[I would, in due course, spend some time on the R Environment (available as part of DecisionStudio Professional, and separately downloadable from http://r-project.org).  R provides an excellent alternative to commercial products for modeling, statistical analysis, and graphics.  Originally designed by AT&#038;T Bell Labs, the R environment is fast becoming the standard for cutting [...]]]></description>
			<content:encoded><![CDATA[<p>I would, in due course, spend some time on the R Environment (available as part of <a href="http://decisionstudio.com/product">DecisionStudio Professional</a>, and separately downloadable from <a href="http://r-project.org">http://r-project.org</a>).  R provides an excellent alternative to commercial products for modeling, statistical analysis, and graphics.  Originally designed by AT&#038;T Bell Labs, the R environment is fast becoming the standard for cutting edge number crunching.<br />
<span id="more-32"></span><br />
Bio-informatics is one area of dominance, and a lot of academic papers these days come with R based/compatible implementations.  (So you don&#8217;t have to wait for your commercial BI vendor to first discover that new technique, make up its mind about it, and then finally get a bunch of disinterested code-monkeys to write a new library for, luckily just in time for the release after next).</p>
<p>While R is used a lot for statistical research, its adoption for econometrics has been comparatively slower, possibly due to terminology differences.</p>
<p>An excellent text called <a href="http://cran.r-project.org/doc/contrib/Farnsworth-EconometricsInR.pdf">&#8220;Econometrics in R&#8221; by Grant V. Farnsworth (PDF)</a> provides a effective hands-on intro to the most common things you would find yourself doing including Time Series, regressions, Plotting, etc.  Have a look.</p>
]]></content:encoded>
			<wfw:commentRss>http://opensourceanalytics.com/2006/03/07/econometrics-on-r-environment/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>DecisionStudio Professional &#8211; Desktop BI Platform</title>
		<link>http://opensourceanalytics.com/2006/02/28/decisionstudio-professional-desktop-bi-platform/</link>
		<comments>http://opensourceanalytics.com/2006/02/28/decisionstudio-professional-desktop-bi-platform/#comments</comments>
		<pubDate>Tue, 28 Feb 2006 17:39:34 +0000</pubDate>
		<dc:creator>Nishith</dc:creator>
				<category><![CDATA[BI, Data Mining, Analytics]]></category>
		<category><![CDATA[Data Warehousing]]></category>
		<category><![CDATA[DecisionStudio-Professional]]></category>
		<category><![CDATA[Modeling]]></category>
		<category><![CDATA[On Your Own]]></category>
		<category><![CDATA[Open Source Analytics]]></category>
		<category><![CDATA[Reporting]]></category>

		<guid isPermaLink="false">http://opensourceanalytics.com/2006/02/28/decisionstudio-professional-desktop-bi-platform/</guid>
		<description><![CDATA[On Friday we released DecisionStudio Professional &#8211; a comprehensive and free desktop BI Platform that gives you all the tools needed for analytics under a single package licensed under GNU Public License (GPL).
DecisionStudio Professional (DSP) is an advanced graphical data mining, reporting, modeling, and analysis environment built on top of the best-of-breed open source projects. [...]]]></description>
			<content:encoded><![CDATA[<p>On Friday we released <a href="http://decisionstudio.com/product" target="_blank"><strong>DecisionStudio Professional</strong></a> &#8211; a comprehensive and free <strong>desktop BI Platform </strong>that gives you all the tools needed for analytics under a single package licensed under <strong>GNU Public License (GPL).</strong></p>
<p>DecisionStudio Professional (DSP) is an advanced <strong>graphical data mining, reporting, modeling, and analysis environment </strong>built on top of the best-of-breed open source projects.  Some of these include:<br />
      &#8212;  <strong>Optimized MySQL database </strong>as data warehouse platform<br />
      &#8212;  <strong>SQL Workbench</strong> (MySQL Query Browser and DBDesigner) for Data Analysts<br />
      &#8212;  <strong>R environment </strong>for statistical analysis and modeling<br />
      &#8212;  <strong>iReport </strong>Reporting GUI and <strong>JasperReport </strong>reporting library<br />
      &#8212;  <strong>Python </strong>with <strong>Boa Constructor IDE </strong>for application and GUI development</p>
<p>DecisionStudio Professional is the only <strong>end-to-end open source analytics platform </strong>that provides comprehensive capabilities to each role.  Data Analysts get to store, process, and publish data on a standard MySQL platform; Reporting Analysts would like iReport and the integration with Office tools; and Modelers would love the excellent R Environment.  It also includes Python along with a drag-n-drop GUI building environment for analytics Application Developers.</p>
<p>You can <a href="http://decisionstudio.com/product" target="_blank"><strong>find out more about DecisionStudio Professional at decisionstudio.com</strong></a>, and can <a href="https://sourceforge.net/projects/ds-professional" target="_blank"><strong>download your copy at Sourceforge.net</strong></a>.   <a href="http://decisionstudio.com/site/wp-content/decisionstudio-professional.pdf" target="_blank">Click here to download the product brochure (PDF).</a> </p>
<p>Go ahead, it&#8217;s completely free and will always stay so.  <img src='http://opensourceanalytics.com/wordpress/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://opensourceanalytics.com/2006/02/28/decisionstudio-professional-desktop-bi-platform/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>Classification Trees (Decision Trees)</title>
		<link>http://opensourceanalytics.com/2005/12/06/classification-trees-decision-trees/</link>
		<comments>http://opensourceanalytics.com/2005/12/06/classification-trees-decision-trees/#comments</comments>
		<pubDate>Tue, 06 Dec 2005 16:53:58 +0000</pubDate>
		<dc:creator>Nishith</dc:creator>
				<category><![CDATA[Modeling]]></category>

		<guid isPermaLink="false">http://opensourceanalytics.com/?p=19</guid>
		<description><![CDATA[Decision trees are one of the most widely used and practical forms of machine learning and data mining. ]]></description>
			<content:encoded><![CDATA[<p>Decision trees are one of the most widely used and practical forms of machine learning and data mining. They have been widely researched and applied to a large variety of data mining problems.  (Decision trees are also known as Classification Trees or Regression Trees based on whether the classification is being done on real values or on categorical variables.)<br />
<img src="http://upload.wikimedia.org/wikipedia/en/f/ff/Decision_tree_model.png" alt="Decision Tree: Forecasting whether Golf will be played based on the Weather condition" height="340" width="460"/><br />
<span id="more-19"></span></p>
<p>Trees are used to predict the membership of new cases into existing classes based on some information.  As an example, the classification of a new loan applicant into risk-based categories can give a bank an idea of how the applicant is likely to perform on his/her loan.  The bank can use this information to approve/reject the loan, and/or to price the loan based on risk.  (Riskier applicants get more expensive loans).</p>
<p>A simple Decision Tree to predict attendance at a Golf CLub based on the weather can be found at <a href="http://en.wikipedia.org/wiki/Decision_tree">Wikipedia::Decision Trees</a> </p>
<p>Decision tree models are built by a process that is known as recursive partitioning.  <a href="http://wekadocs.com/node/2">Click here </a>for a more detailed explanation of the process.</p>
]]></content:encoded>
			<wfw:commentRss>http://opensourceanalytics.com/2005/12/06/classification-trees-decision-trees/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>
