<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Random Test Data Generation</title>
	<atom:link href="http://www.testingmentor.com/imtesty/2009/11/12/random-test-data-generation/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.testingmentor.com/imtesty/2009/11/12/random-test-data-generation/</link>
	<description>Treatises on the practice of software testing</description>
	<lastBuildDate>Tue, 07 Feb 2012 09:15:11 -0500</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.3</generator>
	<item>
		<title>By: testingmentor</title>
		<link>http://www.testingmentor.com/imtesty/2009/11/12/random-test-data-generation/comment-page-1/#comment-104</link>
		<dc:creator>testingmentor</dc:creator>
		<pubDate>Fri, 13 Nov 2009 02:26:32 +0000</pubDate>
		<guid isPermaLink="false">http://testingmentor.com/imtesty/2009/11/12/random-test-data-generation/#comment-104</guid>
		<description>Hi Michael,

Since I explicitly stated &quot;Using only the characters &#039;A&#039; - &#039;Z&#039;...for a filename with an 8-letter filename and a 3-letter extension...&quot; then you are not making an assumption. I intentionally limited the initial domain space as an introductory example using information I thought most readers would be familiar with (and I assumed readers would tacitly understand the number of characters and filename/extension lengths is far greater on modern operating systems). The problem space is actually much larger when taking into consideration variable filename and extension lengths, and the Unicode character repertoire as explained above and in the article.

So, within these constraints I used 268 + 263 because the base filename and the filename extension are 2 separate components and are handled differently by the Windows operating system file system. (Note that I also explicitly stated the testing would be conducted on a &quot;Windows platform.&quot;)

If for some reason we suspected there is interaction or no distinction between the filename component and the extension component (as on a Unix platform) then we would have to test each unique 8-letter filename with each unique 3-letter extension which would equal 2611. 

However, since I stated the test environment is Windows, our in-depth system knowledge of the Windows OS file system tells us the extension is handled as a separate namespace or component from the base filename component. (For example, we cannot save a filename as CON.txt, but we can save a filename as myFile.CON.) Thus, we know that the base filename component and the extension component of the filename are independent, so testing all combinations on a Windows OS would result in approximately 3,670,135,659,905,624 redundant tests (if we could do exhaustive testing).

One could argue that with nominal valid data we could combine the 17576 (263) unique 3-character extension combinations with combinations of the 8 character base filename components to effectively reduce the overall number of tests.

However, I choose to separate testing the extensions and test each extension combination as a unique component independent of the base filename component. So, by using a nominal (known good) base filename component, we would simply add the additonal 17576 tests for each unique 3-letter extension combination. (Thus, 268 + 263.)

(But, I can understand how someone could misconstrue my statement &quot;...the total number of possible character combinations...&quot; when taken out of context from the rest of the paragraph, and that sentence could have been worded more precisely to stand on its own.)

ADDENDUM:

(Of course, the math experts will immediately realize that 268 + 263 does not equal 208,827,099,728. It actually equals 208,827,082,152. It seems that somebody&#039;s fingers got a little happy with the M+ button on the calculator when running the initial numbers.

But, the purpose of the article is to examine the use and value of intelligent random test data in testing; not to debate or over-analyze the number of filename combinations on Windows or Unix platforms. So, I hope the readers understand the point that the number of combinations is quite large (+/- 17576) and utilizing intelligent random data (for positive or negative testing) in conjunction with &#039;real-world&#039; and known problematic static data is a good way to add variablity to a test and expand coverage of potential possibilities since we know we cannot test all probable combinations for a given platform.)

Thursday, May 31, 2007 5:24 AM by I.M.Testy</description>
		<content:encoded><![CDATA[<p>Hi Michael,</p>
<p>Since I explicitly stated &#8220;Using only the characters &#8216;A&#8217; &#8211; &#8216;Z&#8217;&#8230;for a filename with an 8-letter filename and a 3-letter extension&#8230;&#8221; then you are not making an assumption. I intentionally limited the initial domain space as an introductory example using information I thought most readers would be familiar with (and I assumed readers would tacitly understand the number of characters and filename/extension lengths is far greater on modern operating systems). The problem space is actually much larger when taking into consideration variable filename and extension lengths, and the Unicode character repertoire as explained above and in the article.</p>
<p>So, within these constraints I used 268 + 263 because the base filename and the filename extension are 2 separate components and are handled differently by the Windows operating system file system. (Note that I also explicitly stated the testing would be conducted on a &#8220;Windows platform.&#8221;)</p>
<p>If for some reason we suspected there is interaction or no distinction between the filename component and the extension component (as on a Unix platform) then we would have to test each unique 8-letter filename with each unique 3-letter extension which would equal 2611. </p>
<p>However, since I stated the test environment is Windows, our in-depth system knowledge of the Windows OS file system tells us the extension is handled as a separate namespace or component from the base filename component. (For example, we cannot save a filename as CON.txt, but we can save a filename as myFile.CON.) Thus, we know that the base filename component and the extension component of the filename are independent, so testing all combinations on a Windows OS would result in approximately 3,670,135,659,905,624 redundant tests (if we could do exhaustive testing).</p>
<p>One could argue that with nominal valid data we could combine the 17576 (263) unique 3-character extension combinations with combinations of the 8 character base filename components to effectively reduce the overall number of tests.</p>
<p>However, I choose to separate testing the extensions and test each extension combination as a unique component independent of the base filename component. So, by using a nominal (known good) base filename component, we would simply add the additonal 17576 tests for each unique 3-letter extension combination. (Thus, 268 + 263.)</p>
<p>(But, I can understand how someone could misconstrue my statement &#8220;&#8230;the total number of possible character combinations&#8230;&#8221; when taken out of context from the rest of the paragraph, and that sentence could have been worded more precisely to stand on its own.)</p>
<p>ADDENDUM:</p>
<p>(Of course, the math experts will immediately realize that 268 + 263 does not equal 208,827,099,728. It actually equals 208,827,082,152. It seems that somebody&#8217;s fingers got a little happy with the M+ button on the calculator when running the initial numbers.</p>
<p>But, the purpose of the article is to examine the use and value of intelligent random test data in testing; not to debate or over-analyze the number of filename combinations on Windows or Unix platforms. So, I hope the readers understand the point that the number of combinations is quite large (+/- 17576) and utilizing intelligent random data (for positive or negative testing) in conjunction with &#8216;real-world&#8217; and known problematic static data is a good way to add variablity to a test and expand coverage of potential possibilities since we know we cannot test all probable combinations for a given platform.)</p>
<p>Thursday, May 31, 2007 5:24 AM by I.M.Testy</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Michael A Bolton</title>
		<link>http://www.testingmentor.com/imtesty/2009/11/12/random-test-data-generation/comment-page-1/#comment-103</link>
		<dc:creator>Michael A Bolton</dc:creator>
		<pubDate>Fri, 13 Nov 2009 02:26:20 +0000</pubDate>
		<guid isPermaLink="false">http://testingmentor.com/imtesty/2009/11/12/random-test-data-generation/#comment-103</guid>
		<description>Could you help me understand why the figure is 26**8 PLUS 26**3, instead of 26**8 TIMES 26**3, or simply 26**11?  

As I believe you are, I&#039;m assuming that the filenames must be exactly eight alpha uppercase characters, and the extension must be exactly three uppercase alpha characters.  That would be 3,670,344,486,987,776 combinations, not 208,827,099,728.

---Michael B.

Thursday, May 31, 2007 12:30 AM by Michael A Bolton</description>
		<content:encoded><![CDATA[<p>Could you help me understand why the figure is 26**8 PLUS 26**3, instead of 26**8 TIMES 26**3, or simply 26**11?  </p>
<p>As I believe you are, I&#8217;m assuming that the filenames must be exactly eight alpha uppercase characters, and the extension must be exactly three uppercase alpha characters.  That would be 3,670,344,486,987,776 combinations, not 208,827,099,728.</p>
<p>&#8212;Michael B.</p>
<p>Thursday, May 31, 2007 12:30 AM by Michael A Bolton</p>
]]></content:encoded>
	</item>
</channel>
</rss>

