GNU/Linux S5R3 App 4.49 available for Beta test

Gary Roberts

Moderator

Joined: 9 Feb 05

Posts: 5887

Credit: 119407509007

RAC: 25924885

RE: Switching Apps in the

20 May 2008 1:58:50 UTC

Message 81780 in response to message 81778

(moderation:

)

Quote:

Switching Apps in the middle of a task usually confuses the run-time estimation ...

I disagree :-).

My experience says that the run time estimate remains as before and if the new app is appreciably faster then the remaining time estimate will simply reduce faster than it otherwise would do. DCF is not affected by the app change until the task completes at which point the faster than expected finish (of this and subsequent tasks) will cause the DCF to reduce until a new equilibrium is reached. In other words, the app change in the middle should have no immediate effect on DCF.

Two things that will have an immediate effect on DCF are a faulty benchmark test and changing the clock while BOINC is running. There are probably more.

If you just want to fix things then stop BOINC and adjust the DCF in the state file to be 0.922172 x a / b, where a is the true time the task will take and b is the faulty estimate. From Kathryn's figures, a and b should be about 9 and 40 respectively. So change the DCF to be 0.210000 and all should be well. Then restart BOINC.

Cheers,
Gary.

Paper Moon

Joined: 12 Apr 08

Posts: 14

Credit: 292052

RAC: 0

Danke schÃ¶n, Bernd. Die

20 May 2008 3:28:51 UTC

Message 81781

(moderation:

)

Danke schÃ¶n, Bernd. Die S5R3_449.SSE2, obbwohl versehentlich ins Lebenslicht gerÃ¼ckt, ist ein richtiger Volltreffer. Das GlÃ¼ck gehÃ¶rt eben den TÃ¼chtigen. GroÃŸe Klasse!

MfG,
Waldi

Gary Roberts

Moderator

Joined: 9 Feb 05

Posts: 5887

Credit: 119407509007

RAC: 25924885

RE: RE: I think I stand

20 May 2008 4:28:38 UTC

Message 81782 in response to message 81774

(moderation:

)

Quote:

Quote:
I think I stand corrected ....

I'm not sure about that ....

Bikeman was quite correct that the SSE2 version of the app is significantly faster than the SSE version on core architecture. I've now seen results for a Q6600 which show a modest speedup only if using the SSE app but a much better speedup if using the SSE2 app. So, if you are using the beta app on a C2D or C2Q you should be using the SSE2 version if you want the best performance. It would be interesting to know why the SSE version performs relatively poorly on just this architecture.

@Bernd - is there any prospect of a Windows version any time soon? Whilst I had previously converted about half my fleet to Linux some time ago, I don't really want to have to convert the balance if Windows users will get the benefits in the near future.

Cheers,
Gary.

Klimax

Joined: 27 Apr 07

Posts: 87

Credit: 1370205

RAC: 0

I see heavy optimization

20 May 2008 11:36:32 UTC

Message 81783

(moderation:

)

I see heavy optimization going on.
I have not time to see how much of memory copy is in app,but would not be good to SSE(2) it as well?I think about implementation like in VLC media player...(code is however in GPL,so only inspiration most probably :-|)

And how long it will take to see current optimization in Win appp?

Donald A. Tevault

Joined: 17 Feb 06

Posts: 439

Credit: 73516529

RAC: 0

RE: RE: RE: I think I

20 May 2008 13:05:42 UTC

Message 81784 in response to message 81782

(moderation:

)

Quote:

Quote:
Quote:
I think I stand corrected ....

I'm not sure about that ....

Bikeman was quite correct that the SSE2 version of the app is significantly faster than the SSE version on core architecture. I've now seen results for a Q6600 which show a modest speedup only if using the SSE app but a much better speedup if using the SSE2 app. So, if you are using the beta app on a C2D or C2Q you should be using the SSE2 version if you want the best performance. It would be interesting to know why the SSE version performs relatively poorly on just this architecture.

Since I've switched my X9650 Extreme Quad-core machine over to the SSE2 app, I've been getting crunch times as short as just under three hours. It's also done some good on my AMD 6000+ machine, though probably to not as great an extent.

If anyone would like to see for themselves. . .

For my 6000+ and X9650 machines, workunits that have been completed with the regular SSE app are branded with app version 4.49. Any that have been completed with the SSE2 app are branded with app version 44.91. (It was supposed to be "4.491", but the boinc client apparently doesn't know what to do with three decimal places.)

M. Schmitt

Joined: 27 Jun 05

Posts: 478

Credit: 15872262

RAC: 0

Well the speedup on your X2

20 May 2008 13:38:15 UTC

Message 81785

(moderation:

)

Well the speedup on your X2 6000 is even better than on your Xeon. :-)

Lowest boost on your Xeon is less than 10 % while the X2 does not fall below 16.x %. Max. boost is about the same, more than 20% faster.

My X2 5000 even lies between 19 and 22%, but hasn't gotten short units so far.

cu,
Michael

Donald A. Tevault

Joined: 17 Feb 06

Posts: 439

Credit: 73516529

RAC: 0

RE: Well the speedup on

20 May 2008 14:08:35 UTC

Message 81786 in response to message 81785

(moderation:

)

Quote:

Well the speedup on your X2 6000 is even better than on your Xeon. :-)

Lowest boost on your Xeon is less than 10 % while the X2 does not fall below 16.x %. Max. boost is about the same, more than 20% faster.

My X2 5000 even lies between 19 and 22%, but hasn't gotten short units so far.

cu,
Michael

I'm running with Hyperthreading enabled on the Xeon. That may be skewing the results a bit.

Rakarin

Joined: 10 Dec 05

Posts: 9

Credit: 163914

RAC: 0

I've had a successful run.

21 May 2008 23:19:16 UTC

Message 81787

(moderation:

)

I've had a successful run.
I'm running Fedora Core 8 x64, BOINC 6.2.1 / 6.2.2 (recently updated), AMD Athlon x64 4200 (on a single core).

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4349

Credit: 253307522

RAC: 38803

RE: I see heavy

22 May 2008 6:57:03 UTC

Message 81788 in response to message 81783

(moderation:

)

Quote:

I see heavy optimization going on.
I have not time to see how much of memory copy is in app,but would not be good to SSE(2) it as well?I think about implementation like in VLC media player...(code is however in GPL,so only inspiration most probably :-|)

I don't think that there's much copying done in the App, apart from the first ~1min after startup when setting up the memory structures, reading the data files etc.

Quote:

And how long it will take to see current optimization in Win appp?

The prefetching is not new at all, it's already used in the MacOS and Windows (Beta & Power) Apps.

As for the advantage of the SSE2 over the SSE version I first need to find out where it comes from, further measurements will depend on this.

With some recent developments around BOINC I'll have another try on compiling the "science App" for Windows with gcc, too, so we can focus on a single compiler (yes, I'm aware of icc, but using that is even more difficult with our current code).

Klimax

Joined: 27 Apr 07

Posts: 87

Credit: 1370205

RAC: 0

RE: RE: I see heavy

24 May 2008 6:28:51 UTC

Message 81789 in response to message 81788

(moderation:

)

Quote:

Quote:
I see heavy optimization going on.
I have not time to see how much of memory copy is in app,but would not be good to SSE(2) it as well?I think about implementation like in VLC media player...(code is however in GPL,so only inspiration most probably :-|)

I don't think that there's much copying done in the App, apart from the first ~1min after startup when setting up the memory structures, reading the data files etc.
Quote:
And how long it will take to see current optimization in Win appp?

The prefetching is not new at all, it's already used in the MacOS and Windows (Beta & Power) Apps.

As for the advantage of the SSE2 over the SSE version I first need to find out where it comes from, further measurements will depend on this.

With some recent developments around BOINC I'll have another try on compiling the "science App" for Windows with gcc, too, so we can focus on a single compiler (yes, I'm aware of icc, but using that is even more difficult with our current code).

BM

Thanks for answers.

My question was not about prefetching and alike(win beta is being used)... :-) but about SSE2.How long those measurements will take?

And it looks like I forgot that memory-intensive does not have to mean copy op...

GNU/Linux S5R3 App 4.49 available for Beta test

Forums › Cruncher's Corner

Comment viewing options

Forums › Cruncher's Corner