Previous Thread
Next Thread
Print Thread
Fastest A-Shell Hardware? #27321 10 May 07 08:10 AM
Joined: Jun 2001
Posts: 11,925
J
Jack McGregor Online Content OP
Member
OP Online Content
Member
J
Joined: Jun 2001
Posts: 11,925
An A-Shell developer is working on a big new project and has asked my advice about the fastest kind of computer to run an A-Shell application. This will most likely be an application with lots of users, and very LARGE files. I don't have enough experience with different hardware environments to be able to give a definitive answer (and doubt that anyone could, in fact, do that). But in any case, all suggestions or observations about performance-maximization would be most appreciated, including, but not limited, to the following topics:

1. Brand names (IBM, Dell, HP, etc.) and models.

2. OS (most likely Linux) and distribution.

3. CPU type: Intel vs AMD. Multi core, multi CPU?

4. Disk technology and configuration.

For my part, a mixture of incidental and anecdotal experiences, and some theoretical knowledge have led me to the following prejudices:

A. Linux is far faster than Windows. (I'm less sure about how Linux compares with AIX, although our largest systems in the field - over 500 users - do happen to all be AIX.)

B. Dual CPUs are better than a single dual-core CPU. This is logical because A-Shell is single-threaded. Two A-Shell instances can each run on one of the separate CPUs, but I don't think they can share the cores of a single CPU. Dual core may have other benefits though - certainly for OS kernel code, and nowadays are practically standard. 64 bit CPUs do not offer any significant advantage for A-Shell.

C. SCSI or SAS is faster than SATA, particularly when their are queued requests, although perhaps this difference is minimized in very high-end RAID environments. SCSI is also more expensive, but the drives seem to have fewer failures. 15K RPM drives are significantly faster than 10K, since that reduces the average latency (involved in every random access) from 6ms to 4ms. With average seek times in the range of 7ms, those extra 2ms are significant.

D. A-Shell apps are almost always disk-bound before they are CPU-bound. Thus, money should be directed towards the fastest disks, rather than just the highest CPU MHz (which, as you'll notice, is how inexpensive systems are always advertised).

E. In most cases, simple mirroring (RAID 1) is faster than striping (RAID 0, 5, etc), because a smart controller can seek independently from either physical disk. Having multiple independent disks (or mirrors) is better than a single disk subsystem, for the same reason. Put the OS on one disk (or mirror unit), the app on another, or perhaps split the most commonly accessed files in the app on to different physical disks.

F. Application configuration and design may make as much of a difference as hardware, and is difficult to give general advice about. But here are a couple of pointers anyway:

- Don't put too many files in one directory. After several hundred, performance really starts to suffer. Use more directories instead. I see this all the time, where people create sequentially-named output files by the thousands and just put them all in one big directory with the rest of the application files. And then wonder why performance tanks.

- For large files, ISAM is much better than binary search schemes like SERCH.SBR. (A 64K record lookup will take 16 disk accesses in SERCH.SBR, and probably only 4 in ISAM.) An exception might be if you can memory-map the index file, since memory-mapped i/o is so much faster than ordinary i/o.

Re: Fastest A-Shell Hardware? #27322 10 May 07 08:39 AM
Joined: Sep 2002
Posts: 5,486
F
Frank Online Content
Member
Online Content
Member
F
Joined: Sep 2002
Posts: 5,486
Jack, it's hard to add to your already detailed and informative post... but from our experience i can concur on all the above. Here are just a few more morsels:

1) We are a dedicated linux shoppe, and it is extremely fast and robust.. but does bog down when using the main console.. so don't throw any activity there, as all will suffer.

2) I didnt hear you say RAM... I would definately splurge on the extra gb of ram, and go with at least 2gb here... Linux caches very well, and often repeat disk requests dont even hit the hard drive. (ashell also caches well, if you have all the tweaks setup in miame.ini)

3) I also very much agree with separate OS drive and data drive.. for the $$, this is not very expensive.... and for a large data set or volume of users, multiple drives with data spread across helps to. (we don't raid per'se, but rather move files manually across drives for optimum thruput)

4) We have some 100+ users boxes in the field, most with Intel+SCSI, but we have moved recently to AMD dual core (3800 and 4200's) + SATA2... honestly.. these boxes are really fast out of the box... the problem we have had with SCSI is getting the right &!@#$ drivers to function with linux, so we have moved to SATA for comptabily... but are recently having problems adding more than 3 SATA's to a single AMD box... (always something). The largest utility on the largest client processing like 10,000,000 reads takes less than 5 minutes... We have thought about going directly to DELL to have them configure a dual cpu box with multipile SCSI drives and linux pre-loaded but havent gone that route yet.

I think thats all from the far side...

Re: Fastest A-Shell Hardware? #27323 10 May 07 08:51 AM
Joined: Mar 2005
Posts: 494
Ty Griffin Offline
Member
Offline
Member
Joined: Mar 2005
Posts: 494
Anybody replying to this topic: please follow Jack and Frank's examples, and divide the topics into descrete paragraphs that talk only about that particular topic. I'm going to try to assemble all these comments into something coherent and put it into our formal documentations. Thanks.

Re: Fastest A-Shell Hardware? #27324 10 May 07 09:48 AM
Joined: Feb 2002
Posts: 94
T
Tom Jeske Offline
Member
Offline
Member
T
Joined: Feb 2002
Posts: 94
15K drives run much hotter than 10K drives, therefore the enclosure must have positive disk drive ventilation.

I've recently started using RAID 10 for my high performace installs. This is a mirror of striped drives. I was very surprised when I first benchmarked this configuration. This was MUCH faster than a RAID 5 and 10% faster than a RAID 1 for writes and almost 20% faster for reads. This does require 4 drives to net 2 but it really made a big difference.


Moderated by  Jack McGregor, Ty Griffin 

Powered by UBB.threads™ PHP Forum Software 7.7.3