Datasets which are identical over a number of statistical properties, yet produce dissimilar graphs, are frequently used to illustrate the importance of graphical representations when exploring data. This paper presents a novel method for generating such datasets, along with several examples. Our technique varies from previous approaches in that new datasets are iteratively generated from a seed dataset through random perturbations of individual data points, and can be directed towards a desired outcome through a simulated annealing optimization strategy. Our method has the benefit of being agnostic to the particular statistical properties that are to remain constant between the datasets, and allows for control over the graphical appearance of resulting output.
"Thanks a million for your help with refining my problem statement a few weeks ago. I followed your advice, and my resubmitted proposal passed IRB approval. I was most impressed with your time efficiency. You were able to, in a matter of minutes, extract the most important aspects of my existing problem statement, add verbiage that sorely needed emphasizing, and relate it beautifully to my overall purpose statement... nothing short of brilliant on your part, Dr. Lani. I even got positive accolade on my revised problem statement from the ARB reviewer! I will definitely be in touch once my data collection is complete and Chapter 4 is ready for construction."