<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv=Content-Type content="text/html; charset=us-ascii">
<meta name=Generator content="Microsoft Word 12 (filtered medium)">
<style>
<!--
/* Font Definitions */
@font-face
        {font-family:Calibri;
        panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
        {margin:0in;
        margin-bottom:.0001pt;
        font-size:11.0pt;
        font-family:"Calibri","sans-serif";}
a:link, span.MsoHyperlink
        {mso-style-priority:99;
        color:blue;
        text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
        {mso-style-priority:99;
        color:purple;
        text-decoration:underline;}
span.EmailStyle17
        {mso-style-type:personal-compose;
        font-family:"Calibri","sans-serif";
        color:windowtext;}
.MsoChpDefault
        {mso-style-type:export-only;}
@page Section1
        {size:8.5in 11.0in;
        margin:1.0in 1.0in 1.0in 1.0in;}
div.Section1
        {page:Section1;}
-->
</style>
<!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang=EN-US link=blue vlink=purple>
<div class=Section1>
<p class=MsoNormal>I have a bunch of compute servers. They all have local
disks mounted as /scratch to use for computation scratch space. This
ensures maximum performance on all systems, and no competition for a shared
resource during crunch time. At present, all of their /scratch
directories are local, separate and distinct. I think it would be awesome
if /scratch looked the same on all systems. Does anyone know of a way to “unify”
this storage, without compromising performance? Of course, if some files
reside on server A, and they are requested from server B, then the files must
go across the network, but I don’t want the files to go across the
network unless they are requested. And yet, if you do something like “ls
/scratch” you would ideally get the same results regardless of which
machine you’re on.<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>Due to the nature of heavy runtime IO (read, seek, write,
repeat…) it’s not well suited to NFS or any network filesystem…
Due to the nature of many systems all doing the same thing at the same time, it’s
not well suited to a SAN using shared disks… <o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>I looked at gfs (the cluster filesystem) – but –
it seems gfs assumes a shared disk (like a san) in which case there is
competition for a shared resource.<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>I looked at gfs (the google filesystem) – but –
it seems they constantly push all the data across the network, which is good
for redundancy and mostly-just-read operations, and not good for heavy
computation IO.<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>Not sure what else I should look at. Any ideas?<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>TIA.<o:p></o:p></p>
</div>
</body>
</html>