Tom's Hardware Forums » Storage » Hard Disks » 15 Years Of Hard Drive History: Capacities Outran Performance
 

15 Years Of Hard Drive History: Capacities Outran Performance

Add a reply



 Word :   Username :  
 
Bottom
Author
 Thread : 15 Years Of Hard Drive History: Capacities Outran Performance
 
Profile: Tom's Hardware Team
More Information

We've got quad-core processors, gigabytes of RAM and awesome graphics. But what has been going on with hard drives during all the time that these goodies have been developed? Capacities have multiplied and will hit one terabyte per drive soon, but as we'll see, performance hasn't been able to keep up the pace.

Related Pr oduct
Register or log in to remove.

Profile: newbie
More Information

Thanks for the great HDD history lesson, THG. You got your gigs and megs messed up, but hey..... 8O

C’est magnifique, mais ce n’est pas la guerre.
Profile: Forum Master
More Information

Hard Disk drives can only get so fast. I'm pretty sure that the top of the line SCSI's out there have topped out the RPM that you can spin the platters and still be able to read/write effectively.

Profile: stranger
More Information

Can the editors proof read the article and change all the GB to MB for the 40MB drive? There are two obvious mistake (one being the headline introduction for the drive).

Recently Tom's hardware articles seem like they are rushed online...

Profile: journeyman
More Information

I agree, totally annoying. When I first read the article I actually thought it was a 40Gb drive that propably was used by NASA and costed about 20000$ or something like that. The difference between Gb and Mb is tousandfold so when I read something about it I shouldn´t think what it is.

Profile: stranger
More Information

I was an intern for maxtor about a year ago, and was going to work for them before they anounced being bought by seagate and then my job there dried up (right when I was graduating with my BSEE).


I can tell you there is a reason for the change in performance since the old hard drives. It's all due to density. You'd think that the more bits/second you pass by the head the faster you could read right? So higher density would give you better performance... but here's the issue... they dont write 1's and 0's to hard drives anymore. They USED to write 1's and 0's, using an up or down magnetically oriented field and then using peak detection (current caused in the head by the magnetic field is a negative deriviative, so when it just passes by the head you get a spike in voltage as the field on the disk is DC and the only AC part of the effect is beacuse it's moving past the head). Now they use what is called PRML and are switching to another form of a similar thing, it's rather complicated, so let me start explaining......

PRML: This is a genious idea by HD guys to read data off a disk that is smashed too close together, but it causes lots of performance hits. When density started increasing on hard drives, they had issues with the fields of adjacent bits interfiering with eachother, so if you wrote 111 you'd be fine, but if you wrote 101 you wont get a nice separated +1 spike -1 spike +1 spike, you might get +.5 spike -.3 spike +.5 spike (spikes are no longer large enough to register under peak detection). So they said... hey we have faster and faster channel chips (main big chip on the hard drive that controlls the reading/writing for the most part). So... why not over sample the signal? Like refreshing your computer screen, but because of the mathmatical principle called nyquist rate, you have to sample AT LEAST 2x the frequency that the bits are bassing under the head. So you over sample the signal, and to lock the sampling in you use a phase lock loop at the start of each sector..... then you over sample the data, and it looks like the 1's and 0's are interfearing... now here's the part that takes ALOT of processing power... You then go through what is called a Vitirbi (sp?) tree. It's a logic tree that changes what branch it branches to based on the MSE (mean squared error, a probability based number that tells you your error of the read value). This logic tree looks at the interference of the 1's and 0's (which have been written to the hard drive messed up because that's the only way you can do it). Then it looks at how much "error" is in the interference VS a randomly generated bit patter, then goes through the tree until the MSE is as small as possible, and hands over that value as the "data". AKA it guessed at your data using a fairly mathmatically sound method... works most of the time, cuz your hard drive works right? Yep... but that takes alot of processing power and time... and lots of floating point or Q based calculations...

the 620 gig drive maxtor should have out by now had a data frequency of 1 gigabyte/second at the outside of the disk, and the channel ran at 2 gigahurtz (to meet nyquist rate) (dont worry 0 point sampling is prevented by the PLL). The channel almost had more transistors in it than the processors in computers at that point in time (when I was working on it).

The next problem is this: When your data is packed so close together and your data's magnetic field is so small, it's easy to accidentally effect it, the higher the density, the higher this problem is. When you read an adjacent track on the hard drive, or especially write, you can damage the tracks next to it, so before a write to a track you have to read the adjacent tracks, then do your write, then re-write the adjacent tracks as well, and the higher your density the more you have to do this... causing more performance issues. (usually the read degradation is small and they just ignor it). There is ECC (error correcting code) in each sector that can correct a large data vault (I cant really tell you this number as it's a trade secret for each company, but I can tell you that it's upwards of half the bits in the sector).

Some of the hard drives have changed from using PRML to a new "changed" version of PRML, it's different in that instead of using a vitirbi tree it uses a node based forward and back propagation method, with propagation to nodes determined by a modified MSE, I cant give out too much info on it... but it's supposed to be more accurate, and at times can take less time to calculate, but it can also take longer, depends on how much effort you want on trying to get your "scambled" data out. Not to mention that the data is actually scambled when it's writtin on the disk, the data is multiplied by a psuedo random value (mathmatically chosen to prevent large number of adjacent 1's, or 0's, like 111111111, etc) and then unscrambled when read.


The good news: New recording methods can improve performance, bit density (hd size) and reduce processing overhead.... welcome to perpandicular recording! This technology has been worked on for years, and seagate finaly (notice AFTER they merged with maxtor they released this)... hehehe... I'm not bias, I swear... anyways... Normally data is written in a horizontal fashion on the disk, HOWEVER this new method alows the data to be written in a virtical fashion, so the field is oriented virtically through the disk. The center of the disk is a material that allows magnetic field flux through it, so you can still write to both sides of the disk... Because of the way the field fluxes work out, you no longer have as much bit interference, and you can back more data together... and get this.... When you write a 1 you might write (+1 for say field pointing up and -1 for field pointing down) +1+1+1+1+1, and for 0 -1-1-1-1-1... when each time the head passes over the series of field chunks, you get a spike... and if you filter all those spikes together... you get a nice smooth signal... welcome back to 1's and 0's!! So with perpandicular recording you get increased disk size, back to writting 1's and 0's and better data integrity, with increased performance due to less overhead (no PRML). YAY!

You guys should have tested one of seagates perpandicular recording drives in the line up..... i'd like to see what it will do, and look forward to looking forward to what they can do in the future.

Never think that technology has reached it's end... humans always somehow come up with a new way of getting what they want done... for what ever reason they want it.

Factboy
Profile: Ancient Poster
More Information

Wow, thanks for all that information, fascinating! I never realized how non-perpendicular (specifically PRML) hard drives worked!

Profile: stranger
More Information

It would also be interesting to have some information here about trends in the durability of harddrives over the years; newer drives seem to have far shorter lifespans than 'old' ones - my old Quantum fireball 6.2 is still going strong but I've burnt through endless modern Maxtor drives of far greater capacity over the same period. I suppose it's hard to determine whether this is the result of the higher stresses on the newer drives imposed by their higher performance or just that there's more money to be made by selling high-capacity drives that fail after a couple of years but are cheaply-replaced...

Profile: newbie
More Information

I’ve been telling people this for years.

Everyone talks about CPU and GPU power and the latest high speed ram. No one ever talks about hard drives. So I'll say it again.

Instead of going out and buying that 500 gig western Digital... Go buy 2 250 gig drives and raid stripe them.

I now have 3 sets of of Western Digital raptor drives:

2 X 36
2 X 72
2 X 150 (sata2)

I have 7 systems at home and no matter what I put in a box, running 2 drives in stripping has by far the greatest impact to perf.


PS. I remember buying that Maxtor 40 meg hard drive for my old 386 16 and it was a beast!!!!

What does this do.......
Profile: Honorary Poster
More Information

A couple of things need to be taken into account.

First of all, the rate of data transfer depends on the mechanism for data transferral. I am sure that they could develop a way to have several bitstreams running concurrently to teh disk and have several heads reading/writing on the same disk, same side/etc, but I am sure that with each bitstream you use, the management of them gets increasingly complicated.

Also, if you use a parking lot analogy yuo can also see how one thing can outpace the other. Initially we were all driving bombers and could only fit a certain ammount of cars in there, but as time went by, we changed cars to smaller, and optimized the parking layout. We put in an extra attendant and people were able to get in and out faster.

Now later, you put in a parking garage, add another attendant and go from there. You are not fitting 10 times the people in there, but your attendants only went from 2 to 3, so your performance was trounced by your storage capacity. Add to the mix that now, although we have faster attendents, with quicker cash registers, we are parking our cars vertically. I wonder if there will be another ratio lag between performance and storage or if, as teh resident techie here informed us, that we actually catch up (all the cars parking there get E-Z Pass... ;) )

The analogy can also be extended to water storage and pump capacity and a number of different associations, but the end result is the same. Increasing capacity is easier than increasing the performance on a physical read-write drive.



I am just wondering what is happening with flash-drives and whether that teoretical 100G will ever be possible within a CF cards footprint. If that is the case, parallel read/write for dozens of these cards may be the next step in mass storage on our machines.

Imagine having 20 of them arranged in your machine in a RAID array! 1 goes bad, an LED indicates which one, you replace that one alone!

Of course, for this to become a reality, the prices will have to drop significantly, but I am anxious to see how this all plays out in the next 10 years...

Profile: enthusiast
More Information

Quote :

I’ve been telling people this for years.

Everyone talks about CPU and GPU power and the latest high speed ram. No one ever talks about hard drives. So I'll say it again.

Instead of going out and buying that 500 gig western Digital... Go buy 2 250 gig drives and raid stripe them.

I now have 3 sets of of Western Digital raptor drives:

2 X 36
2 X 72
2 X 150 (sata2)

I have 7 systems at home and no matter what I put in a box, running 2 drives in stripping has by far the greatest impact to perf.


PS. I remember buying that Maxtor 40 meg hard drive for my old 386 16 and it was a beast!!!!



Maybe you edit video ? RAID will not improve typical desktop performance that_much. I have 4xRAID0 running on my system here, and other than for seriously HDD intensive tasks, it offers no speed increase. Mostly I only see improvements when loading saved games, or game levels, editing Video, and extracting large archive files (oh, and seeing Sandra bench 4X barracudas(ATA 100) at 132MB/s, and 7 ms random access times is neat too). Of course, if you want to shave some time off your OS boot times, RAID0 can also come in handy, but unless you're running 0+1, or perhaps RAID10, you're playing with fire. *shrug*

Don't get me wrong, I'm a serious hardware buff, and love the idea of any type of RAID, but unfortunately, except for the occasional desktop application, it doesn't really offer much of a performance boost. I've even noticed that putting my swap file on a 4x RAID 0 array made very little(if any) impact on overall system performance. That is, a noticeable performance difference, of course.

All this being said, I still want to get a good Areca RAID controller, and run a serious 4-8x RAID0 Array (before swapping to RAID 5, or 6) just to see how fast this array would perform. Call it the hardware "kid" in me, that still keeps trying to eek out every last possible ounce of performance . . .

P.S. I also had that 40MB HDD in my 386 sx 25 (1993), and still have the drive that replaced it, a 80MB Maxtor, and it still works !

Profile: stranger
More Information

Please oh please add the noise/audible level of hard drives to your hard drive chart. That would be very helpful.

I care more about noise level than getting that 1/1000th performance increase over another hard drive.

Profile: newbie
More Information

Quote :


Maybe you edit video ? RAID will not improve typical desktop performance that_much. I have 4xRAID0 running on my system here, and other than for seriously HDD intensive tasks, it offers no speed increase. Mostly I only see improvements when loading saved games, or game levels, editing Video, and extracting large archive files (oh, and seeing Sandra bench 4X barracudas(ATA 100) at 132MB/s, and 7 ms random access times is neat too). Of course, if you want to shave some time off your OS boot times, RAID0 can also come in handy, but unless you're running 0+1, or perhaps RAID10, you're playing with fire. *shrug*

Don't get me wrong, I'm a serious hardware buff, and love the idea of any type of RAID, but unfortunately, except for the occasional desktop application, it doesn't really offer much of a performance boost. I've even noticed that putting my swap file on a 4x RAID 0 array made very little(if any) impact on overall system performance. That is, a noticeable performance difference, of course.

All this being said, I still want to get a good Areca RAID controller, and run a serious 4-8x RAID0 Array (before swapping to RAID 5, or 6) just to see how fast this array would perform. Call it the hardware "kid" in me, that still keeps trying to eek out every last possible ounce of performance . . .

P.S. I also had that 40MB HDD in my 386 sx 25 (1993), and still have the drive that replaced it, a 80MB Maxtor, and it still works !



I have a lot of computers (at home, I manage 250 at work) right now that are pretty much the same spec such as:

P4 2.6 (533)
p4 2.8 Prescott
p4 3.0 Prescott
p4 3.0 Prescott
p4 3.4 Prescott
AMD 3500+
AMD 4400+ X2
Core 2 Duo 2.6


All with similar hardware (same Asus MB, generic Kingston ram and such).

One may have the raid, one may have say a 200 gig Western digital Sata drive.

The difference in doing any task in Windows is greatly improved. Simple things such as opening Itunes (with 60 gigs of MP3's) is significantly improved. I actually find it hard to use the non-raid PC's since the delays the HD cause are noticeable.


Real life example
When I backup the 60 gigs in mp3's from one to another, I can fire up WOW on the AMD 4400+ with absolutely no hard drive lag (hell, I cant even tell its crunching the data). On the Core 2 duo with 4 gigs or ram but only a 250 gig sata drive, the PC is severely lagged and cannot be used for much until its done. This applies to all of my PC's. The raid drives clearly perform much better.


I think one does not really see the benefit until you sit them next to each other and just use the PC's.

Profile: stranger
More Information

WTF!?!?

Where the HELL are my SCSI drives?!!?

Profile: newbie
More Information

Quote :

WTF!?!?

Where the HELL are my SCSI drives?!!?



SCSI only good on servers foo!!! :)

Profile: stranger
More Information

Quote :

SCSI only good on servers foo!!!



lol.. don't be silly, I'm using the Seagate 15k.5 73gb drive in my GAMING computer.

Profile: newbie
More Information

Quote :

SCSI only good on servers foo!!!



lol.. don't be silly, I'm using the Seagate 15k.5 73gb drive in my GAMING computer.

SCSI or Sas?

Profile: stranger
More Information

Quote :

SCSI or Sas?



SCSI.

Profile: newbie
More Information

Quote :

SCSI or Sas?



SCSI.

Well, for the same price (or less with the SCSI card) you could have probably bought 2 Western digital 150gig Sata2 rapters and blow that drive away in striping.

Scsi is great on my servers, I would never run it at home.

I just rolled out a Dell Poweredge NAS with Storage server X64 enterprise. The thing has 8 300 gig SAS drives in Raid 0+1 (1 terrabye of data). The thing smokes but I still wouldnt put those drives in on my gamming PC. Im not running DFS at home and 250 users dont use it to file server :)

Oh, ok.
Profile: enthusiast
More Information