It looks like I selected the wrong delivery type so my new 480s devils are not coming to Monday.
So started looking at memory errors.
I noticed on watttool info it said if you are overclocking memory check for memory errors using HWINFO
now not used this before so downloaded it and fired it up, I'm only interested in the sensor info so ticked the box sensor info only.
So I'm currently running my 31.25 mod across my 480s so how are they doing
well it seems a bit of a mix.
1 card is producing almost no memory errors at 2250/920
1 card is showing a very big number and climbing a lot.
the others are climbing but showing around 1mill errors
The number is a total so climbs per tick.
I found on some cards changing the settings and applying in watttool reset the number to 0 which was handy and other didn't which was odd and inconsistent.
now being new to this info I'm not actually sue what number is a good number, but I'm going with errors in memory can be corrected but poss not at the very high rate I was seeing on one card.
So seeing a few over a second say 30-50 was prob going to be ok but ones that are climbing at a rate of over this per sec are prob not doing the work you want it to do , despite the hash rate showing on claymore.
So a small adjustment of the mem speed 1 as low as 2190 now gives me lower or similar to 30-50 a sec errors. I found some wanted a lower speed and other just a tine amt more power.
my expectation here is that if a gpu is producing GH's and 1million errors per sec its unlikely to be doing the work you think it should be and while not logging errors at the app layer doesn't mean the hashes are correct...not really sure this is accurate but with out much info to go on it seems a reasonable assumption.
I'll keep an eye on pool side and see if a more inline with expected hash rate is seen after ironing out the memory errors seen and if the odd gpu hang which seems to happen with the 480s goes away or lessens.
Boysie
3 ·
Comments
I'm running a Asrock H81 Pro BTC mainboard, both molex connections to motherboard, 6x powered USB risers, 3 lots of sata connections from the power supply, with 2 cards on each connection sata set of power cables coming from the power supply;
Current Power supply is - CoolerMaster Vanguard 1200W 80+ Platinum Full Modul, +3.3/+5v are 25a, 125w output power, +12v 100a/1200w, -12v .5a/6w, +5Vsb 3a/15w
Next Power Supply will test, also a platinum; I feel this will give better stability.
+3.3/+5v 30a, 180w output, +12v 100a/1200w, -12v .8a/9.6w, +5Vsb 3.5a/17.5w
so the rom is no the point of the post, did you check your cards for memory errors?
1: 5 Memory errors at 2000Mhz
2: 496 Memory errors at 1950Mhz
3: 291,383,451 Memory errors at 2000Mhz
2 are msi and 1 is gigabyte
There is a reason why The Stilt is known for his work (source).
Once you have a significant amount of errors which get through, you'll get visible artifacts
and before that its just a loss of perf.
so thins explains why most people don't see a perf increase after 2200, the error rate prob increases massively and the gain is very little.
glad we are on a forum where people can openly discuss things they don't fully understand;
thanks for the link.
Boysie
It's kind of like HW error on btc asics, under 5% it does not affect the hashrate much.
I will check it today for my cards.
Not sure if losing a tiny bit of hash rate is worth shaving off the 140mill errors though.
I have since changed back to 1500 mem straps and there are 0 memory errors on any of the cards for my conservative 1050/870 core 1870/870 mem settings.
Will report back if this improves my poolside hash.
I would any-day take people sharing experimental settings they use over peeps with deep understanding who share nothing on the forum. You live you learn.
5 cards report 0 errors while one is picking up errors at a rate of 100k per 2-3 seconds. I've tried resetting the MEM to 1000 but the errors still build up.
As for reported vs. calc hashrate - they are pretty much identical. Occasionally the pool reports +/- 3MH
For now I'm back to my highest Hash settings ignoring the memory errors, since neither claymore or the pool report any difference. If cards begin failing ill switch back to stabler settings, but I dont really care if Im reducing card life from 5 years to 2. These are going to be resold in 10 months tops anyways.
I have decided to go back to the 1500 straps and they have been rock solid at ETH 26.8/ SIA 241 dual mining with the same clocks 1050/870 1870/870 and I am going to be sticking to these settings.
For a few cards you can perhaps check if the 1375 strap is stable for you and check in HWinfo for memory errors. If your cards pass, that memory strap will yield godlike performance/watt. But since I run about 30 cards, its hard to track and manage which cards run at what strap and settings etc so I just go with the minimum stable settings across the board.
And regarding the "effective average" hashrate at the pool... well that I don't know, but intend to verify. I'd appreciate If anyone would enlighten me on the matter, 'cause I'm having trillions of them...
Does anyone have 0% diff of average effective vs actual hashrate on ethermine?
My current thoughts are: hash is a hash is a hash. And "effective average" is network issues and pool/setup fuckups, not related to the card performance.
And I do have 10% lower the actual on ethermine.
EDIT: The number of errors on the image is for 20+hours of runtime
Your 100% memcontroller usage tells nothing about mem errors, because correcting mem errors also takes effort from the memcontroller.
So the question is what portion of the work produces valid shares.
If it's 90%, then the pool shows 90%.
10% of missing hashrate are seemingly mainly network issues...
workers also in a vlan and I've seen some weird behavior once the vpn server went down.
current dos is also to be considered.
Bottom line: I'm not yet convinced that mem errors is a factor of any sort. Unless someone will point out that with no errors on higher voltage (I run at .850 all) 470 consumes less...
I think I'm slightly below 150 on average. Total 9 cards(2*480+7*470 = [email protected] & [email protected]) on 2 platforms consume below 1400W off the wall.)
The pool 'lacking behind the actual' issue I plan to investigate, but now is not the time to run solo for my taste.
...also even in case I'm wrong I care none, since the cards out of the box give 22MH and consume more and I've got them all at 28.0-28.7 dual mining. EDIT: 480 does 29.1 and drives display.
@Boysie Totally support this kind of approach to be applied to the real world as well
No one is getting 100% of their reported hashrate as average 24h hashrate right? I get about 95%
What are you guys getting?
I'm expecting a silicon wall at 0.6