Nor-Tech’s NT-EZ HPC Cluster Support Solutions
A complete portfolio of HPC cluster support solutions for effortless deployment, operation, and maintenance
Nor-Tech has been a primary innovator in the HPC cluster space for more than a decade. Our elite staff of expert engineers, averaging more than a decade of hands-on experience, is consistently conquering industry challenges that our competitors long abandoned as insurmountable. The result is a suite of cost-effective NT-EZ branded services that remove the classic obstacles to high performance computing.
- NT-EZ RepliSafe: This is a new, easy-to-use bare metal recovery solution designed specifically for software-mirrored SAS/SATA/NVMe drive pairs. It enables full system restoration directly from mirrored SAS/SATA/NVMe drives without needing additional configuration.
- NT-EZ Remote Visualization: Storage access too slow? With NT-EZ Remote Visualization, big data files don’t have to be transferred to the user’s location to be viewed for post processing. Only keyboard, mouse and screen bits travel to the user location.
- NT-EZ Storage Guard: -StorageGuard continuously monitors filesystem capacity and storage health across critical HPC infrastructure. It is designed to detect and alert on storage conditions before they become outages or job-impacting failures.
- NT-EZ SATM (System Ambient Temperature Monitor): NT-SATM monitors environmental and ambient temperatures across HPC systems and infrastructure to help identify cooling problems before they impact system stability or hardware reliability.
- NT-EZ RCR Rapid Cluster Recovery/Bare Metal Backup: Run manual cluster backups or schedule backups seamlessly. This not only backs up the cluster data, but also backs up an image of both the data and the OS. In the event of a disaster the whole cluster software and applications environment can easily be recovered. This adds a level of reliability to our clusters that many competitors don’t provide.
- NT-EZ Remote Monitoring & Management: This provides an easy way for our clients to use and manage complex equipment such as a High Performance Compute Cluster without HPC expertise in-house. This service is provided on either a scheduled basis or an as needed basis. The scheduled service allows us to find and fix small problems before they become disasters. This service is sold in blocks of hours ahead of when the service may be needed.
- NT-EZ AI: Expertly integrated to maximize AI capabilities and deliver the fastest results. The solution includes CUDA, CUDA Toolkit, PyTorch, TensorFlow, CUDNN, NVIDIA Drivers, Ubuntu and more.
- NT-EZ Grafana: An open-source analytics and interactive visualization web application for monitoring application performance. It allows users to ingest data from a wide range of sources, query and display it in customizable charts, set alerts for abnormal behavior, and visualize data on dashboards. Watch the video.
- NT-EZ NodeStat (Cluster visibility at a glance): NT-EZ NodeStat provides a real-time operational view of HPC cluster health, node availability, scheduler state, storage usage, and hardware status from a single interface. Designed for Linux-based HPC environments running Slurm or PBS, it gives administrators and users immediate insight into the state of the cluster without digging through multiple commands or dashboards.
- NT-EZ ImageManager (Simplified Cluster Image Deployment and Management): ImageManager streamlines the creation, organization, deployment, and maintenance of operating system images used in HPC and provisioning environments. Built for Warewulf/OpenHPC workflows, it simplifies the process of managing golden images across compute infrastructure.
- NT-EZ PowerControl (Centralized node power management for HPC clusters): NT-EZPowerControl provides administrators with fast, centralized control of cluster node power operations through integrated BMC/IPMI management. It simplifies power orchestration for maintenance, recovery, provisioning, and cluster startup operations.
- NT-EZ FanControl (Intelligent fan management for HPC and enterprise systems): NT-EZ FanControl provides centralized control and monitoring of system fan modes across cluster infrastructure using BMC/IPMI interfaces. It allows administrators to quickly query, adjust, and standardize cooling profiles across large numbers of systems from a single management point.
We are continually evaluating additional areas where we can innovate in order to bring more value to our clients.
Request Info
SEND US A MESSAGE
"*" indicates required fields