Next: About this document ...
Contents
- Introduction
- Critical factors to Grid performance
- Aggregated metrics
- Drawing the Goodness Frontier
- DISTECM assessment
- DISTECM deployment & confirmation
- Future Work
- Conclusions
Introduction
The context:
- Load distribution is often needed in HPC
- Bad decisions cost time and resources
- Grid computing introduced new variables in deployment decision
This talk introduce:
- Auxiliary indicators for measuring the potential applicability of a parallel application to a Grid
- Used by our Group to access the applicability of an ECM implementation to a Grid
2Scenarios
Not all parallel implementations are adequate to all Grids.
We may have:
We were in the first case.
Critical factors
- Minimum bandwidth required
- What is the minimum bandwidth between nodes that application requires
- Maximum latency allowed
- Data transmission frequency
- How often there is a need to transmit data between nodes;
- Amount of data to transmit
Aggregated metrics
Frequency of Interruptions for Communication (FIC)
Total Time of Data Transmission (TTDT)
Aggregated metrics II
Maximum Communication Overhead (MCO)
Maximum Latency Overhead (MLO)
Goodness frontier
- A - maximum limit to TTDT above which it does not pay to
use the grid, even for small values of FIC. To define its value we
can take in account the latency through the MLO ratio;
- M - slope of the goodness straight line
The slope can be found through:
Goodness application ratio
[width=7cm]imagens/racio.eps
ECM over the grid (DISTECM)
An Elliptic Curve over a field K is the set of solutions that respect:
 |
(1) |
DISTECM:
- Core ECM functions provided by Paul Zimmermann ECM code
- Distributed version based:
- - Master / slave distribution paradigm
- - Globus toolkit 2
- - BSD sockets (and not PVM / MPICH)
DISTECM Assessment (grid 1 - Low-end grid)
- Composed of old desktops (Pentium III 750 Mhz)
- Slow connections (ISDN 64 Kbps)
[width=7cm]imagens/raciob.eps
DISTECM Assessment (grid 2 - Beowulf Cluster)
- Cluster of PCs built with commodity components and off-the-shelve hardware solutions
- Connected with a 100Mbps FastEthernet
[width=7cm]imagens/racioc.eps
DISTECM deployment
Hardware specification:
- Cluster of Dual-Xeons
- Fast-Ethernet 100 Mbps
Challenge specification:
The number P(10341) has 109 digits and was easily factored into
DISTECM tests
[width=7cm]imagens/distecm.eps
DISTECM results
Factored C98 as:
Using 42 hours of the SMP Dual-Xeon cluster (with 8 processors).
Future work
- Enhancements of these metrics in the scope of the Global grid Forum (GGF)
- Developing distributed factorization platforms: NFS
- With CWI (which was involved with RSA 512 factorization)
- Already factored
[b](8.6,-3.0)[width=5cm]imagens/nfsarq.eps
Conclusions
Finally:
- Depicted metrics can be of help in the process of defining a formal approach to understand what applications are appropriate for use in grids
- Should not be used in a deterministic way as a ``yes or no'' oracle suggestion
- Useful to perform grid / application specification analysis and avoid that requirements definition is based only in ``common sense experience''
- Distributed ECM (DISTECM) platform has proved to be a good approach
for factoring numbers under 100 digits
Thank you. Questions ??
Paulo Trezentos
Paulo.Trezentos@iscte.pt
http://paulo.trezentos.gul.pt/articles/
Next: About this document ...
Paulo Treentos
2004-01-12