A high-performance cluster (HPC) is an array of modern processors working in parallel to transform massive amounts of data. A cluster presents a new set of challenges by creating millions of new relationships between CPU/GPU processors, memory, network, storage, and layers of middleware and software.
That volume of data is too much for a human administrator to remember, understand, and act upon in real time!
AutonomousHPC can record, analyze, and understand over 1000 metrics per node, per second.What else can AutonomousHPC help you with?
AutonomousHPC helps you monitor and manage your Spectrum Scale HPC in the way autonomous cars can react to conditions faster than a human. You retain control and allow AutonomousHPC to take over when you want it to, like a car that can parallel park by itself.
The time needed to process the typical genomic assay jumped from 4 to 6 hours to 24 hours. These performance challenges were preventing the customer from meeting real-world service level agreements (SLAs) for cancer patients.
Algorithms in AutonomousHPC helped understand the sophisticated changes in the system and indicate a path to resolution. It determined that the cache was filling up and ultimately causing the system failure.
An education customer has a $500,000 budget to upgrade the system. If the customer doesn’t spend correctly, it could just be throwing dollars at the problem.
High-performance clusters can turn a computational bottleneck into a storage bottleneck. As the speed of the processor doubles, it may outpace the supporting infrastructure.
AutonomousHPC helped understand the performance bottlenecks to efficiently distribute the budgetary dollars and ensure maximum output.
In Houston, HPC managers’ paychecks are tied to datacenter efficiency. Every dollar counts toward the annual bonus. This includes hardware, software, administrators’ salaries, power, hosting, etc.
AutonomousHPC is used to ferret out any inefficiencies in the system to maximize the output while minimizing actual costs.
Spectrum Scale provides the data layer for key claims and policy operations. Any outage results in immediate challenges to customer-facing operations.
AutonomousHPC is on the lookout for configuration changes, suspicious trends, and alerts of critical issues.
Although supercomputers are typically built from the same commodity components, each cluster includes a unique blend of manufacturers, technologies, proportions, workloads, and applications. No one vendor can support the entire system, leaving the support to the incumbent engineering staff.
AutonomousHPC learns how the system functions and then detects subtle trends and substantial changes, ensuring that the human administrative team is operating with the best understanding of the system they own.
AutonomousHPC is built on familiar technology.
It is a plug-in for Elasticsearch for analytics and Grafana for graphic reporting elements.
It is a modular appliance that lives on-premises, so your data never leaves your firewall and it does not add processing overhead to your HPC.
Your administrators can manage more nodes and petabytes.
Just like the latest generation of autonomous cars, which can drive for you but allow you to operate with help, AutonomousHPC can either adjust for you or notify you of which changes are needed. Alerts are sent via email, SMS, and dashboard. You have the data necessary to manage your system the way you want to manage it.
Learn more about what autonomous technology can do for your data center. Schedule a briefing today.