Ping a Specific Port

Question

knweiss

Asked: 2010-06-11 12:38:06 +0800 CST2010-06-11 12:38:06 +0800 CST 2010-06-11 12:38:06 +0800 CST

How to get the best LINPACK result and conquer the Top500?

772

Given a large Linux HPC cluster with hundreds/thousands of nodes. What are your best practices to get the best possible LINPACK benchmark (HPL) result to submit for the Top500 supercomputer list?

To give you an idea what kind of answers I would appreciate here are some sub-questions (with links):

How to you tune the parameters (N, NB, P, Q, memory-alignment, etc) for the HPL.dat file (without spending too much time trying each possible permutation - esp with large problem sizes N)?
Are there any Top500 submission rules to be aware of? What is allowed, what isn't?
Which MPI product, which version? Does it make a difference?
Any special host order in your MPI machine file?
Do you use CPU pinning?
How to you configure your interconnect? Which interconnect?
Which BLAS package do you use for which CPU model? (Intel MKL, AMD ACML, GotoBLAS2, etc.)
How do you prepare for the big run (on all nodes)? Start with small runs on a subset of nodes and then scale up? Is it really necessary to run LINPACK with a big run on all of the nodes (or is extrapolation allowed)?
How do you optimize for the latest Intel/AMD CPUs? Hyperthreading? NUMA?
Is it worth it to recompile the software stack or do you use precompiled binaries? Which settings? Which compiler optimizations, which compiler? (What about profile-based compilation?)
How to get the best result given only a limited amount of time to do the benchmark run? (You can block a huge cluster forever)
How do you prepare the individual nodes (stopping system daemons, freeing memory, etc)?
How do you deal with hardware faults (ruining a huge run)?
Are there any must-read documents or websites about this topic? E.g. I would love to hear about some background stories of some of the current Top500 systems and how they did their LINPACK benchmark.

I deliberately don't want to mention concrete hardware details or discuss hardware recommendations because I don't want to limit the answers. However, feel free to mention hints e.g. for specific CPU models.

1 Answers

Voted

superbeast · Answer 1 · 2010-06-22T14:37:13+08:00

superbeast

2010-06-22T14:37:13+08:002010-06-22T14:37:13+08:00

Give this tool a try it might help you, it suggests tuned values for some of the critical HPL parameters and there's a step by step howto for running HPL on clusters. The tool also estimates your rank in the TOP500 list depending on your system specs:

http://hpl-calculator.sourceforge.net

I hope you find it useful.

1

How to get the best LINPACK result and conquer the Top500?

Ping a Specific Port

How do I tell Git for Windows where to find my private RSA key?

How do you restart php-fpm?

What's the default superuser username/password for postgres after a new install?

What port does SFTP use?

Resolve host name from IP address

How can I sort du -h output by size

Command line to list users in a Windows Active Directory group?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?