Greg
Greg's diary notes
April 2005
Greg's diary
Greg's photos
Greg's home page
Greg's links
Google

Monday, 4 April 2005

I've spent a lot of time today looking at hardware.  Apart from the
usual Fedora CD-R read errors, I've established:

1.  ECC memory works better when it's turned on.
2.  The memory you lent me (ex opetrix) hits about 1 ECC error a
    second while running memtest.  I'll bring a CD with me to our next
    meeting.

Discussing this on the efnet #BSDcode list was interesting:

* groOgle removes 4 GB memory from obelix and burns his fingers.
<groOgle> Boy, these mammary chips run hot!
<Darius> weird
<Darius> mine are nice and cool
<kirma> yawn
<groOgle> Darius: How much memory?
<ket> CC and C3 runs hot because it's got poorer electrical qualities
<ket> duh
<ket> have to ram the signal through basically
<Darius> groOgle: 2x256Mb
<Darius> although the eflux from the CPU heatsink washes right over them
<groOgle> Darius: I suppose 8 times the memory generates about 8 times
          the heat.
<Darius> heh true
<Darius> you have 2Gb DIMMs?
<groOgle> Yup.
<groOgle> I can't get the bloody machine to run with 8 GB, though.
<groOgle> And that's what I'm testing.
<ket> 2GBs run goddamn hot, they're all CC or C3
<ket> 2.6 or 2.7?
<Darius> what's a CC/C3?
* groOgle reboots tomato.
<groOgle> ket: Hold on, photo coming.
<ket> market grading on the TSOPs
<groOgle> http://www.lemis.com/grog/Photos-20050404.html#RAM-1
<ket> CC/C3 is basically 'meh, it isn't defective, hopefully.' pretty
     much lowest level of testing pass, possibly not tested higher but
     usually rejected
<groOgle> ket: So does the photo tell you anything.
<ket> samsung doesn't reject parts they know won't pass, doesn't test
      parts they know won't pass.  same for micron.  all 2gb dimms are
      cc/c3.
<ket> patience grasshoppa
<ket> especially since your url is 404
<groOgle> Damn.
<groOgle> The photos are there, just the main page is missing.
<ket> heh
<ket> gonna fix/ ;P
<groOgle> Hold on, any day now.
<groOgle> OK.
<ket> WTF?!
<ket> BGA?$!@&*(!@$!@?!
<ket> what.  THE.  *FUCK*!
<Darius> ooh sexa
<ket> NOT sexy
<Darius> looks like a routing nightmare :)
<ket> 4SD11...  that's odd
<ket> in fact, that's not a valid micron part number...
<Darius> why would they be micron chips?
<ket> bottom corner has MT
<ket> micron log
<ket> er micron logo.
<Darius> OK
<ket> wintec doesn't private-label or private-source.
<Tekrad> micron has a custom labeling system....
<ket> yeah.  wintec doesn't use it.
<ket> and regardless i should be able to get a hit on partial
<ket> going to have to try parametric
<ket> micron does NOT make a lot of BGA parts.
<ket> they're doublesided?
<ket> that's..  18 per side for a total of 36 per dimm...  which is
      technically an invalid ecc configuration
<ket> and they'd have to be 64MB depth which micron doesn't make
<ket> not in a PC3200 BGA
* groOgle watches in amazement as memtest spews forth one ECC error
          after another.
<ket> i'm not surprised.
<ket> memtest can't do ecc worth shit
<ket> but...  those parts are..  well, theyr'e fucking suspicious at minimum.
<ket> the BGA layout is weird and a half.  4+5 flipped?
<ket> and with micron not making a 64MB PC3200 part...  it's hard to
      say if those are just outright forgeries from china or if
      they're overclocked.
<ket> at most it can be a 128MB with that many, and their smallest
      PC3200 BGA is a 256MB
* groOgle listens with interest.
<ket> so, most likely you're looking at overclocked memory, which is a
      big no-no in ecc.
<ket> take it off SPD and try PC2700
<Darius> you should have got quality RAM like my GEIL ;)
<ket> geil is overclocked outright lying trash  
<ket> well now, this is odd.
<Darius> but at least it works
<groOgle> ket: This memory has an advantage that outweighs everything
          you've said so far.
<ket> groOgle - what's the voltage on it? 2.6V?
<groOgle> ket: How can I tell?
<groOgle> THe MB is set to "auto", and doesn't even tell me what that means.
<ket> oh you have gotta be shitting me...
<ket> in the bios should be a setting for dimm voltage
<groOgle> i.e.  how man V.
<groOgle> Sure there is.
<ket> oh christ.
<groOgle> But there's an "auto" setting.
<ket> fuck you msi.  retards.
<ket> auto means 'read spd'
<groOgle> And since I don't have any info, that's what I'm using.
<ket> ...  DUH!
<ket> groOgle - memtest reads SPD
<ket> i need the SPDEEPROM output
<ket> that'll tell me the REAL part numbers _and_ voltage
* groOgle suspects that ket is saying "the setting is correct".
<groOgle> OK, how do I dump it?
<ket> when memtest starts it shows spd contents
<ket> e.g.  SPEED: PC3200 (DDR400B)  VOLTAGE: 2.6V
<ket> etc
<groOgle> ket: OK, I'll check when I restart.  There's nothing showing
           on the screen right now, but I want it to complete the
           tests.
* groOgle notes approximately one ECC error per second.

Tuesday, 5 April 2005

<ket> HCoyote - how's your memory-fu?
<HCoyote> I can remeber that I've got 10 fingers and 10 toes ...
<HCoyote> and that today is monday
<ket> HCoyote - 36 chip 2GB, bank arrangement 4x9..  that's not valid is it?
<HCoyote> could be chipkill
<ket> it's not, checked the pns.
* grOogle notes that HCoyote fails the memory test.
<HCoyote> dunno then ...  haven't had my hands on 2GB parts for a few months.  I try to keep my hands out of the
+         computers now.
<ket> HCoyote - heh.
<grOogle> HCoyote: If they're my chips, they do chipkill.
* ket is still trying to figure out what hte fuck grOogle got his hands on there, because wintec doesn't list it in
+ their reseller listings either
<ket> grOogle - no they don't.  i can make out the pns on the supports and they don't support chipkill
<HCoyote> any part numbers?
<grOogle> HCoyote: Hold on.
<ket> HCoyote - the micron numbers are fudged, but hang
<ket> ics A16859AK
<ket> also a DN431090C
<grOogle> http://www.lemis.com/grog/Thumbnails-20050404.html
<ket> HCoyote - hang on i'll link you
<ket> the pair is non-pll fanout yes? i can never fucking remember..  :(
<ket> nonono..  dumbass me.  hang on.
<grOogle> ket: BTW, these chips work fine in the server for which they were purchased.
<ket> grOogle - it's probably overvolting the dimms
<HCoyote> what's the problem then?
<ket> grOogle - i have an abit kd7a that does it unintentionally, lets me run midgrade (C2) kingston parts at
+     incredibly tight timings
<ket> HCoyote - well for one that dimm seems to not exist.  :P
<grOogle> ket: I checked wiht memtest86+.  No info about the voltage or anything else.
<ket> HCoyote - okay, the 13:26 register is the 16859AK
<ket> grOogle - memtest86 is far from foolproof :(
<ket> 43109 is..  making me suffer through a very slow website :P
<grOogle> ket: You said that it would supply info about the voltage and other stuff.  It doesn't.
<ket> grOogle - it does SOMETIMES
<ket> grOogle - it does on my netfinity, bp6, and kd7a.  it SOMETIMES does on the kt7-raid.  if the bios interferes with
+     spd, you're pretty much SOL
<ket> HCoyote - oopsie, i botched the pn.  it's 95V857AG, 2.5 wide range freq clock driver; like i said - no chipkill
<ket> WOAH
<ket> http://www.icst.com/icscs/PartSummary.aspx?id=97e3ff13-ab42-459e-bf7c-d736439be249&name=ICS95V857AH&mode=full
<ket> note voltage :P
<ket> also number of outputs.
<HCoyote> so ..  again ..  what's your issue with the memory?
<HCoyote> it's a 2GB registered ECC dimm
<ket> HCoyote - well now i'm pretty much positive that it's not what it claims.  the MT P/Ns don't match private label
+     (which wintec never uses) and the parts just don't match
<ket> HCoyote - dude, micron doesn't make anything below a 256MB in that package for pc3200
<ket> i checked.
<ket> the 13:26 also doesn't go below 210MHz, which i can't imagine is conducive to 200MHz operation
<grOogle> below?
* HCoyote scratches head.
* HCoyote goes to find that ram testing consortium website thing.
<ket> grOogle - right; it's operating range is specifically 210-270MHz
<ket> HCoyote - cmtlabs.com?
<ket> HCoyote - i'm heading there again now, see if they've got fresh
<ket> wintec's been testing w/intel and hasn't certified or even tested a 2GB part, only a 1GB which is listed on AMPO
+     and reseller
<ket> ah, there it is..  35964741-L, but that's a 2GB PC2700 not PC3200
<grOogle> ket: When did they come up with a minimum operating frequency?
<grOogle> ket: Well, in that range, anyway.
<ket> grOogle - that's the minimum operating frequency of a specific _part_
<grOogle> ket: When did they come up with a minimum operating frequency?
<ket> grOogle - see the square ICS chip in the middle? _that_ part.  and that's just how the part works.
* grOogle gives up trying to get an answer to his question.
<ket> grOogle - i've just told you all i know about that particular IC.  it runs at a range of 210MHz to 270Mhz with a
+     voltage range of 2.3-2.7V and a nominal of 2.5V
<ket> interesting..  http://www.cmtlabs.com/images/Scans/WTC037.jpg
<ket> that's a pc2700 2gb from wintec..  which has the exact same parts as this supposed pc32700
<ket> er 3200
<ket> ...  including the pcb ID
<grOogle> Hmm.  That looks familiar.
<ket> grOogle - because it's the exact same dimm. 
<ket> it's got different memory chips, but it's the _exact_ _same_ _dimm_
<grOogle> You mean the layout?
<ket> no, i mean it's the exact same part, exactly.
<ket> the ONLY difference is micron chips instead of infineon.  other than that, it is identical down to the support
<grOogle> So they can handle the frequencies of both.
<HCoyote> same dimm doesn't mean anything, really.
<HCoyote> We had a few sets of ATP memory at work ...  same size ...  same speed ...  same part number ...  one was low
+         profile, the other wasn't.
<ket> grOogle - not necessarily
<ket> HCoyote - btdt (and atp is good stuff of course), but this is different..  there's been a lot of forging going on
+     past six months, encouraged by geils prior forgery
<ket> HCoyote - a LOT of people selling nonexistant parts, overclocking by reprogramming the eeprom or just
+     mislabelling
<HCoyote> um.  what wintec part at cmtlabs were you looking at?
<ket> HCoyote - the only 2GB module they've tested, 35964741-L
<ket> http://www.cmtlabs.com/memSearchResults.asp?sManuf=Wintec+Industries&sPN=35964741-L&oSubmit=Search
<HCoyote> that's not the same part as what's pictured in his photo.
<ket> HCoyote - no, it's not by the partnumbers.
<ket> HCoyote - but stickers are easy to forge; look at the electrical component pns.  they're absolutely identical.
+     same for support layout.
<ket> HCoyote - pcb numbers _also_ match digit for digit
<ket> Register A16859AK PLL 95V857AG
* ket watches HCoyote compare and go 'oh..  it _is_.'
<ket> grOogle - is accutek your vendor?
<grOogle> ket: I borrowed this RAM.
<ket> sorry, should've said 'a local vendor', and yes they are.
<ket> with a really shitty website
<ket> www.accutek.com.au - so that answers that question
<HCoyote> grOogle:  if you have any doubts, just talk to wintec.  It's not unheard of for RAM vendors to parts out
+         that aren't listed on their website yet.
<ket> HCoyote - agreed, but i didn't even see it on resale list.
<ket> THAT'S what bugs me
<HCoyote> ket:  Tyan has that part listed as a valid part for several of their Intel motherboards.
<ket> it's possible the resale list was out of date, but with all the forgeries and grey market bullshit.. 
<ket> HCoyote - which part? the pc2700?
<HCoyote> no
<HCoyote> http://www.tyan.com/support/html/memory_s2885.html
<HCoyote> go down to the DDR400 section.
<ket> uhm, s2885 ! intel
<HCoyote> err, sorry
* ket smacks
<HCoyote> http://www.tyan.com/support/html/memory_s5360.html
<ket> well i'll be damned
* HCoyote kicks ket
<HCoyote> STFU.
<ket> ahahah pdp patriot.  hahahaha
<HCoyote> so, anyway ...
<HCoyote> grOogle:  if you have any doubts about the authenticity, just ask Wintec.  I'm sure they'd be able to point
+         you in the right direction.
<ket> hahaha..  ocz..  ocz's micron 512mb passes but their 1gb failed, almost assuredly.  pdp is funny though, since
+     they're not pure samsung..  gamble
<ket> HCoyote - images are high enough to do quickie compare on the sticker, it doesn't look forged
<ket> msi just can't push the load, no surprise
<HCoyote> so why did we go on this wild goose chase?
<ket> HCoyote - because i fucking hate forgeries. 
<ket> ran into a shithole stickerfactory that was labelling flaky pc66 as pc100 once
<HCoyote> it's likely an OEM'd part.
<ket> considering wintec doesn't do any retail parts? :P
<ket> it's a mass-production resale part
<HCoyote> funny, looks like you can buy it online through wintec's website.
<HCoyote> (wintec ram, in general)
<ket> HCoyote - yah, in antistatic bags or if you buy enough, in a packbox. 
<ket> HCoyote - e.g.  not retail box
<ket> you don't get instructions, fancy box, how tos, etc

Wednesday, 6 April 2005

<HCoyote> wow ...  1ecc a second is bad.
<grOogle> HCoyote: Right.
<grOogle> HCoyote: Do you agree with my conclusions?
<ket> grOogle - that the board sucks? of course :D
<HCoyote> That's excessive.
<HCoyote> way excessive.
<ket> HCoyote - aka 'average for msi'
<ket> HCoyote - remember, the board doesn't even have the memory traces correct; 4 dimms to 1 cpu
<HCoyote> the author of bluesmoke was talking to us about testing that ECC was working and suggested we aim a blow
+         dryer at the RAM while stressing the system with memtest ...  sounds ilke you may not have adequate cooling
+         around the memory.
<ket> HCoyote - it's not that; it was voltage.
<HCoyote> (and yes, we tested that ...  it works)
<ket> HCoyote - known issue with the k8t-master2-far.  the fucking thing hasn't got anywhere _NEAR_ enough voltage reg.
<HCoyote> ah
<HCoyote> yeah ...  we don't touch MSI.  
<grOogle> HCoyote: Could well be.  The DIMMs get too hot to touch, a first in my experience.
<ket> HCoyote - yeah, i'm pretty much the most knowledgable about msi here.  been working with them since '94
<HCoyote> grOogle:  stick a fan over the DIMMS while you're testing and see if that helps.
<ket> HCoyote - to say they've gone downhill is the understatement of the millenium.
<grOogle> HCoyote: I suppose I should, but I've wasted too much time on this already.
<HCoyote> We've had to do that with some lab systems that have 56GB of ram and not adequate cooling.
<grOogle> HCoyote: The DIMMs were only loaners, and the memory I have in there now works fine.
<HCoyote> (we ended up sticking some delta black labels right on the dimms and noticed a drammatic improvement)
<HCoyote> <nod>
<HCoyote> We're only using DDR333 for our opterons.  <shrug>
<HCoyote> (except for the few Suns we have)
* HCoyote wants to get one of the HP 2p opteron desktops for home.
<ket> HCoyote - you should be using 2.5V PC3200
<ket> HCoyote - every opteron i've built, i've specced 2.5V.  it's dramatically cooler.
<HCoyote> 1) it's a waste of money on older processors 2) it's a waste of money if you have issues with boards that
+         don't support it correctly.
<HCoyote> We don't build systems.  We buy them.
<HCoyote> previously, we built 1 and sent it to the local vendor to duplicate.
<HCoyote> now, we just buy tier 1
<HCoyote> On that note, I'm going to go watch people beat each other with sticks.
<quiet1> ...  televised british parliment?


Greg's diary Greg's home page Greg's photos Greg's links

Valid XHTML 1.0!