by Julio Jerez » Thu Mar 22, 2018 2:38 pm
Hey Joe, I now have the parallel solver working.
if you sync, the test you will see will no mean much, and It all other demos may malfunction, this set to force to call the parallel solve with that simple stack. and check is stable enough.
bodies are not going to sleep, and to my this is sufficient to warrant continjeing developing it.
The big diffrence with prevois solvere is that thsi sone does no requyired a pre collising pass over teh joint array to find teh set that can be solve in parallele in diffrent thread.
The convergerence rata si definitklley lowe that teh sequacial solver, however we already took care of that by assigning improtant contact to be solve exatly wi eth excatc solver.
'for exampel a car or a tack will tak teh contact with it and solve then exactly,
wh eteh paralle solve will so is that It will deal with massive joint set in singel or multiple island, and we can make up fo teh converege by adding more passes fi we need too.
The solve si a striiped down version fo teh odl solve, it does no deal with CCD or sleep oa anythong liek the sequential solver, is just a brute fore many thread joint crunch solver. so it will goes to the maximun numbe of passes.
My analogy is like a express subway in Manhatan, teh have thsi traing that until stop a some specific tation, an dteh have local traing that stop at every station.
if you go from, Long Isaln to the Bronx, it take you 4 or five hours if you take a local train to teh near stop, teh a express, the near close to teh destination and anoteh local to you final, it take problably 45 minutes.
The parallel solve will be liek that, a prepresess face will combine domb joint and make a singel large island, it will filet CCD, smal isnald at rest and stuff liek that, the I wuill sove all smal isnald calling sequential solve, and on lareg island with parallel solver.
This is not done yet, it is still working isnald by island, but that easy to do this weekend. for now I will test more and make sure is stabel and rubust.
Her eis teh godd thong Joe, thes solve can work In hight performance computing, take for example my machine at home, it is and I7 7700, it come with an enveded GPU that if I god to the bios, I do not need a descrete GPU.
It seems that fi we try OpenCL again, we can make this system use these enveded GPU and APU as amtha comprosessors.
The GPU is my cpu is a The HD Graphics 630, the review aren' clear but some say it has 24 compute units and other say 48, whatever that number is, is a hell of a lot more cores than any 8, 10, 16 cores in the CPU. even if is runs at half the speed the mass calculation is worth trying it.
these are DX12 class GPU, so they must be fast.
I figured that if the OpenGL driver let you use the embedded GPU as a coprossesors, amost any system out there has one of these. and the driver probably take advantage of the memory configuration, so maybe there is not the kind of stuff people have to do when using discret graphics.
I know for a fact that is consulse this PCI bus limitation do no exist, because the memory is unified.
anyway if you sync you will see a stack of 20 boxes standing solve using the parallest solver.