Optimized RakeSearch app for rank 9 (computations finished)

Message boards : Number crunching : Optimized RakeSearch app for rank 9 (computations finished)
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · Next

AuthorMessage
Profile Conan
Avatar

Send message
Joined: 8 Sep 17
Posts: 44
Credit: 10,378,024
RAC: 3,271
Message 730 - Posted: 5 Feb 2019, 1:47:08 UTC - in response to Message 729.  

Thanks Daniel for your efforts in getting this out there.

However I seem to have a problem with your new programme, all I get is

<core_client_version>7.4.25</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
SIGILL: illegal instruction
Stack trace (10 frames):
[0x411740]
[0x4f0b10]
[0x40c282]
[0x409d6c]
[0x40cdc4]
[0x4063dd]
[0x404f5c]
[0x4f21a4]
[0x4f2421]
[0x4058b6]

Exiting...

</stderr_txt>

Which is trashing a heap of work units, I re-downloaded and re-installed the files but this made no difference.
I noticed that the line <platform>xxxx</platform> had been added to the apt_info.xml file so I removed that to see if it made a difference but alas it did not.
So I will re-install your older version and see if all starts working again.

Thanks
Conan


Which app was used? The 955 doesn't have AVX so the SSE app would be required.

Same exe name so these should be a drop in replacement.

Thanks for the great work Daniel.


SSE is the one I downloaded (twice, in case I had a corrupted file).
I have to wait for other work to clear before I can see if the old one is still working, then I might try the new one again.
My other 2 computers have not cleared their work cache yet so have not downloaded new work, so I don't know if they work with the new optimized application.

Conan
ID: 730 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Conan
Avatar

Send message
Joined: 8 Sep 17
Posts: 44
Credit: 10,378,024
RAC: 3,271
Message 731 - Posted: 5 Feb 2019, 6:29:16 UTC

It is now getting worse. All my computers have killed every work unit they get (Windows XP and Linux Fedora).

All was going great but this new version has just killed everything.

I have now uninstalled and re-installed on one computer, then downloaded the new version again, stopped Boinc and extracted files, then restarted Boinc and the 1 WU that was on the computer after reinstall then crapped itself and died.

So seeing now when new work downloads if the new version works from a bare slate.

Conan
ID: 731 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [B@P] Daniel

Send message
Joined: 8 Sep 17
Posts: 99
Credit: 402,603,726
RAC: 0
Message 732 - Posted: 5 Feb 2019, 10:01:11 UTC

I have checked this and found bug in compilation options. SSE2 app versions uses SSSE3 instructions which are not supported by your CPU, so they crash with error/signal "Illegal Instruction". I will release fixed app versions later today. Until then please use previous app version, or non-SSE one.
ID: 732 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Conan
Avatar

Send message
Joined: 8 Sep 17
Posts: 44
Credit: 10,378,024
RAC: 3,271
Message 733 - Posted: 5 Feb 2019, 13:43:53 UTC - in response to Message 732.  

I have checked this and found bug in compilation options. SSE2 app versions uses SSSE3 instructions which are not supported by your CPU, so they crash with error/signal "Illegal Instruction". I will release fixed app versions later today. Until then please use previous app version, or non-SSE one.


Thanks Daniel,

I have put all back to the old application and am now again processing some work.

Conan
ID: 733 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
M0CZY
Avatar

Send message
Joined: 24 Aug 18
Posts: 6
Credit: 104,687
RAC: 26
Message 737 - Posted: 6 Feb 2019, 10:07:08 UTC - in response to Message 733.  

Neither the 32-bit Linux SSE2 app or the 64-bit Windows SSE2 app work for me (Error code 193) on my computers which don't support SSE3, but the plain 32-bit Linux app does work fine.

The 64-bit Windows SSE2 app only works for me on a computer that has SSE3.
ID: 737 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
jozef j
Avatar

Send message
Joined: 11 Sep 17
Posts: 51
Credit: 193,013,720
RAC: 1,965
Message 741 - Posted: 7 Feb 2019, 10:42:17 UTC

Fantastic work daniel and Natalia all Rake search team.

First in boinc history: i have 500K day RAC on single cpu with easy cooling cheap aio .. all intel expensive servers cpus are deep behind .. r/ayymd
on linux is possible get 10%+ more in totall , but my rac on windows is still not show full optimisations from daniel.. must wait week,two..
ID: 741 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
joe carnivore

Send message
Joined: 31 Oct 17
Posts: 3
Credit: 16,456,450
RAC: 1,833
Message 744 - Posted: 9 Feb 2019, 7:36:01 UTC

on my AMD x6 1075 and Athlon x2 BE-2400 doesn't work the SSE2 app.
I7 920 and Celeron j3455 work it fine.

It`s dont work with AMD.
ID: 744 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
hoarfrost
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Help desk expert

Send message
Joined: 11 Aug 17
Posts: 624
Credit: 20,555,850
RAC: 7,981
Message 749 - Posted: 9 Feb 2019, 9:00:29 UTC - in response to Message 741.  

Fantastic work daniel and Natalia all Rake search team.

First in boinc history: i have 500K day RAC on single cpu with easy cooling cheap aio .. all intel expensive servers cpus are deep behind .. r/ayymd
on linux is possible get 10%+ more in totall , but my rac on windows is still not show full optimisations from daniel.. must wait week,two..

josef j, you built a great computer! AMD Ryzen Threadripper 2990WX - is most powerful x86 CPU now. As minimun - until EPYC II Rome.
ID: 749 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [B@P] Daniel

Send message
Joined: 8 Sep 17
Posts: 99
Credit: 402,603,726
RAC: 0
Message 761 - Posted: 11 Feb 2019, 19:17:33 UTC

I have uploaded fixed SSE2 version, and SSSE3 version (notice triple S here, it is Supplemental SSE3). It turned out that with new compilation options these versions are a bit faster than previous "SSE2" version:

SSE2:
real    4m8.098s
user    4m6.121s
sys     0m0.032s

SSSE3:
real    4m2.163s
user    4m0.198s
sys     0m0.018s

Previous "SSE2":
real    4m14.850s
user    4m12.858s
sys     0m0.047s

ID: 761 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Theadalus

Send message
Joined: 20 Jun 18
Posts: 2
Credit: 183,750,083
RAC: 0
Message 763 - Posted: 12 Feb 2019, 20:16:34 UTC

@ Daniel

Will there be also a "AVX2 NOPEXT" v1.1 for AMD? :thumbsup:
ID: 763 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [B@P] Daniel

Send message
Joined: 8 Sep 17
Posts: 99
Credit: 402,603,726
RAC: 0
Message 764 - Posted: 12 Feb 2019, 22:19:08 UTC - in response to Message 763.  

@ Daniel

Will there be also a "AVX2 NOPEXT" v1.1 for AMD? :thumbsup:

No. New optimized app does not use PEXT instruction, so no need to build separate app version. Please use AVX2 version.
ID: 764 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Conan
Avatar

Send message
Joined: 8 Sep 17
Posts: 44
Credit: 10,378,024
RAC: 3,271
Message 765 - Posted: 14 Feb 2019, 9:08:58 UTC - in response to Message 761.  

I have uploaded fixed SSE2 version, and SSSE3 version (notice triple S here, it is Supplemental SSE3). It turned out that with new compilation options these versions are a bit faster than previous "SSE2" version:

SSE2:
real    4m8.098s
user    4m6.121s
sys     0m0.032s

SSSE3:
real    4m2.163s
user    4m0.198s
sys     0m0.018s

Previous "SSE2":
real    4m14.850s
user    4m12.858s
sys     0m0.047s


Thanks Daniel, the new SSE2 optimizations work a treat and are in fact as you said, a bit faster than before, by quite a few minutes.

Thanks again for the effort and great work.
Conan
ID: 765 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Astronomy.Karelia

Send message
Joined: 7 Sep 17
Posts: 5
Credit: 29,201,364
RAC: 25,075
Message 770 - Posted: 16 Feb 2019, 10:54:55 UTC

Есть компьютер с 3 гигабайтами памяти, процессором Intel Pentium Dual Core E2220 и новым пустым жёстким диском.
Планируется поставить Windows 7 на него.
А какой лучше поставить?
32 или 64 разрядный?
32 или 64 разрядные оптимизированные под SSSE3 приложения RakeSearch будут считать быстрее?
Проводились ли замеры скорости счёта на одном и том же компьютере но под 32 и под 64 разрядный Windows?

I have a computer with 3 gigabytes of memory, Intel Pentium Dual Core E2220 processor and a new empty hard drive.
I planned to install Windows 7 on this computer.
What's version of Windows 7 is the best to install?
32 or 64 bit?
32 or 64 bit SSSE3-optimized RakeSearch applications will be count faster?
Whether measurements were carried out of speed of crunching on the same computer but under 32 and under 64 bit Windows?
ID: 770 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [B@P] Daniel

Send message
Joined: 8 Sep 17
Posts: 99
Credit: 402,603,726
RAC: 0
Message 773 - Posted: 17 Feb 2019, 15:19:35 UTC - in response to Message 770.  
Last modified: 17 Feb 2019, 15:29:20 UTC

Есть компьютер с 3 гигабайтами памяти, процессором Intel Pentium Dual Core E2220 и новым пустым жёстким диском.
Планируется поставить Windows 7 на него.
А какой лучше поставить?
32 или 64 разрядный?
32 или 64 разрядные оптимизированные под SSSE3 приложения RakeSearch будут считать быстрее?
Проводились ли замеры скорости счёта на одном и том же компьютере но под 32 и под 64 разрядный Windows?

I have a computer with 3 gigabytes of memory, Intel Pentium Dual Core E2220 processor and a new empty hard drive.
I planned to install Windows 7 on this computer.
What's version of Windows 7 is the best to install?
32 or 64 bit?
32 or 64 bit SSSE3-optimized RakeSearch applications will be count faster?
Whether measurements were carried out of speed of crunching on the same computer but under 32 and under 64 bit Windows?

Please use 64-bit OS and app, 64-bit software can use more registers and has SSE2 by default, so it usually is faster than 32-bit one.

All my previous benchmarks were done on 64-bit Linux.

Windows does not pin CPU-intensitive apps to one CPU core like Linux does, they are constantly floating between them. This adds extra overhead because of context switching, so Windows results are usually few percent worse than Linux ones.

I did some benchmarking to see how 32-bit apps perform on my Haswell Xeon. This was also done on 64bit Linux. CPU-intensitive apps like this one do not have to perform many syscals or use system libraries a lot, so results for 32-bit apps should be similar on 32 and 64 bit systems.

SSSE3 64-bit:
real    4m2.163s
user    4m0.198s
sys     0m0.018s

SSE2 64-bit:
real    4m8.098s
user    4m6.121s
sys     0m0.032s

SSSE3 32-bit:
real    4m37.972s
user    4m36.001s
sys     0m0.032s

SSE2 32-bit:
real    4m56.755s
user    4m54.779s
sys     0m0.040s

Non-SSE 32-bit:
real    4m55.787s
user    4m53.806s
sys     0m0.044s


As you can see, 32-bit apps are slower than 64-bit ones. Result for non-SSE app is a bit surprinsing, I suspected that limitations of 32-bit software combined with various CPU hardware optimizations are responsible for this. It would be interesting to see some benchmark results from old CPUs like your ones, unfortunately I do not have such machine.
ID: 773 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Astronomy.Karelia

Send message
Joined: 7 Sep 17
Posts: 5
Credit: 29,201,364
RAC: 25,075
Message 774 - Posted: 17 Feb 2019, 16:38:19 UTC

Thanks Daniel,

I will install 64 bit Windows.

And how to make such benchmarks?
ID: 774 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [B@P] Daniel

Send message
Joined: 8 Sep 17
Posts: 99
Credit: 402,603,726
RAC: 0
Message 776 - Posted: 17 Feb 2019, 19:58:05 UTC - in response to Message 774.  

Thanks Daniel,

I will install 64 bit Windows.

And how to make such benchmarks?

You can download sample data file and shell script used to start test from https://github.com/sirzooro/RakeSearch/tree/boinc/RakeDiagSearch/RakeDiagSearch/test. It can be run directly in Linux. On Windows you will need to install Cygwin. You can also try MinGW or MSYS, they also should work, but I did not try to use them - I prefer Cygwin.
ID: 776 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rymorea

Send message
Joined: 27 Apr 18
Posts: 8
Credit: 1,000,659
RAC: 6,330
Message 777 - Posted: 18 Feb 2019, 0:40:23 UTC

hi I installed avx version "rakesearch_windows_64_avx_v11.zip" and start to crunch but I am not sure this is the correct one for my old intel 2500K cpu.
It was take to finished 1425 sec. is it good time or do I try SSE - SSE2 version ?
When I look boinc event log its like this and show AVX support.
Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 sse4_1 sse4_2 popcnt aes syscall nx lm avx vmx tm2 pbe

Waiting your advice
ID: 777 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Astronomy.Karelia

Send message
Joined: 7 Sep 17
Posts: 5
Credit: 29,201,364
RAC: 25,075
Message 778 - Posted: 18 Feb 2019, 7:26:45 UTC - in response to Message 776.  


You can download sample data file and shell script used to start test from https://github.com/sirzooro/RakeSearch/tree/boinc/RakeDiagSearch/RakeDiagSearch/test. It can be run directly in Linux. On Windows you will need to install Cygwin. You can also try MinGW or MSYS, they also should work, but I did not try to use them - I prefer Cygwin.


Thanks, Daniel.

I will do tests in 2 - 3 weeks, when will the opportunity and time.
ID: 778 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Astronomy.Karelia

Send message
Joined: 7 Sep 17
Posts: 5
Credit: 29,201,364
RAC: 25,075
Message 780 - Posted: 19 Feb 2019, 13:18:02 UTC

Today I found time and opportunity.

I tested this benchmark on two identical computers with Pentium Dual-Core CPU E5400 and 4 GB RAM.

All settings of BIOS on both computers to do the same.

CPU Thermal Trottling - Disabled
Intel SpeedStep tech - Disabled

On both computers for the period of the test BOINC and Antivirus was stopped.

On first computer was installed 32 bit Windows 7 and 32 bit Cygwin Terminal.

On second computer was installed 64 bit Windows 7 and 64 bit Cygwin64 Terminal.

On 32 bit Windows was tested 32 bit rakesearch.exe from archive rakesearch_windows_32_ssse3_v11.zip

On 64 bit Windows was tested 64 bit rakesearch.exe from archive rakesearch_windows_64_ssse3_v11.zip

Results:
---------------------------

32 bit SSSE3 rakesearch.exe

real    5m59,453s
user    0m0,000s
sys     0m0,015s

---------------------------

64 bit SSSE3 rakesearch.exe

real    5m21,828s
user    0m0,015s
sys     0m0,000s
ID: 780 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Astronomy.Karelia

Send message
Joined: 7 Sep 17
Posts: 5
Credit: 29,201,364
RAC: 25,075
Message 782 - Posted: 21 Feb 2019, 6:18:06 UTC

Additional test. The same two computers.
32 bit SSE2 rakesearch.exe

real    6m15,250s
user    0m0,000s
sys     0m0,015s

---------------------------

64 bit SSE2 rakesearch.exe

real    5m28,938s
user    0m0,000s
sys     0m0,015s
ID: 782 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · Next

Message boards : Number crunching : Optimized RakeSearch app for rank 9 (computations finished)

©2024 The searchers team, Karelian Research Center of the Russian Academy of Sciences