Skip to content

Conversation

sjkelly
Copy link

@sjkelly sjkelly commented Aug 20, 2019

This aims to use Hwloc.jl to compute block size based on the algorithm described in http://www.cs.utexas.edu/users/flame/pubs/flawn74.pdf.

Test don't pass yet since (I think?) the micro kernel needs to be parameterized by nr and mr.

Also based on my understanding of the paper some initial variables pertaining to SIMD size and latency need to still be set to determine the optimal block size. I am unsure at the moment if Hwloc will tell us this info.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant