Network Consulting Engineer – VXLAN & AI Data Center Networking
Posting date: | 14 August 2025 |
---|---|
Salary: | £45,000 to £47,000 per year |
Hours: | Full time |
Closing date: | 13 September 2025 |
Location: | Milton Keynes, Buckinghamshire |
Remote working: | On-site only |
Company: | GAK ENTERPRISES LTD |
Job type: | Permanent |
Job reference: | NCE_GAK_08_2025_05 |
Summary
We are seeking a highly skilled Network Consulting Engineer (NCE) to design and implement next-generation VXLAN EVPN-based data center networks that support hypercomputing and AI infrastructure. This role is central to enabling the high-performance, low-latency environments required for multi-GPU clusters, distributed training pipelines, and scalable AI workloads.
The ideal candidate will bring deep expertise in VXLAN, BGP, EVPN, RoCEv2, and fabric-based networking, along with a solid understanding of the networking demands of large-scale AI and HPC environments.
Key Responsibilities:
•Design, deploy, and manage VXLAN EVPN-based fabric networks supporting AI and high-performance computing workloads.
•Build scalable spine-leaf architectures using Cisco Nexus 9000 or Arista switches, ensuring efficient Layer 2/Layer 3 segmentation and workload mobility.
•Optimize network performance for RoCEv2, GPUDirect, and low-latency traffic flows, ensuring maximum throughput and minimal jitter for GPU-accelerated clusters.
•Configure underlay and overlay protocols including BGP, OSPF, IS-IS, and troubleshoot connectivity and control-plane issues.
•Integrate data center fabric with virtualization platforms such as VMware NSX, KVM, and Hyper-V, as well as Kubernetes-based AI infrastructure.
•Collaborate with compute, storage, and DevOps teams to ensure end-to-end performance across AI training, fine-tuning, inference, and ETL pipelines.
•Drive network automation efforts using tools such as Python, Ansible, and Terraform, enabling Infrastructure-as-Code (IaC) for repeatable, scalable deployments.
•Participate in performance tuning, benchmarking, and capacity planning for evolving AI workloads and future-proof infrastructure designs.
Required Skills & Expertise:
•Proven experience designing and operating VXLAN EVPN-based data center networks.
•Strong knowledge of Cisco ACI, Nexus 9000, or Arista EOS platforms.
•Expertise in data center routing and switching, including BGP, OSPF, IS-IS, multicast, and fabric path.
•Deep understanding of AI-specific networking needs, including RoCEv2, DCB, PFC, and RDMA optimization.
•Hands-on experience with automation and orchestration tools such as Python, Ansible, and Terraform.
•Familiarity with GPU-accelerated workloads and distributed systems in AI/ML and HPC environments.
Preferred Qualifications & Certifications:
•CCNP or CCIE Data Center (highly preferred)
•Cisco Certified Specialist – Enterprise Core or Data Center Core
•Additional certifications in NVIDIA Networking (Cumulus/Spectrum) or Arista ACE are a plus
•Experience with hybrid and multi-cloud networking for AI clusters is desirable. Please share your CVs to jobs@inetsoftwaresolutons.co.uk
The ideal candidate will bring deep expertise in VXLAN, BGP, EVPN, RoCEv2, and fabric-based networking, along with a solid understanding of the networking demands of large-scale AI and HPC environments.
Key Responsibilities:
•Design, deploy, and manage VXLAN EVPN-based fabric networks supporting AI and high-performance computing workloads.
•Build scalable spine-leaf architectures using Cisco Nexus 9000 or Arista switches, ensuring efficient Layer 2/Layer 3 segmentation and workload mobility.
•Optimize network performance for RoCEv2, GPUDirect, and low-latency traffic flows, ensuring maximum throughput and minimal jitter for GPU-accelerated clusters.
•Configure underlay and overlay protocols including BGP, OSPF, IS-IS, and troubleshoot connectivity and control-plane issues.
•Integrate data center fabric with virtualization platforms such as VMware NSX, KVM, and Hyper-V, as well as Kubernetes-based AI infrastructure.
•Collaborate with compute, storage, and DevOps teams to ensure end-to-end performance across AI training, fine-tuning, inference, and ETL pipelines.
•Drive network automation efforts using tools such as Python, Ansible, and Terraform, enabling Infrastructure-as-Code (IaC) for repeatable, scalable deployments.
•Participate in performance tuning, benchmarking, and capacity planning for evolving AI workloads and future-proof infrastructure designs.
Required Skills & Expertise:
•Proven experience designing and operating VXLAN EVPN-based data center networks.
•Strong knowledge of Cisco ACI, Nexus 9000, or Arista EOS platforms.
•Expertise in data center routing and switching, including BGP, OSPF, IS-IS, multicast, and fabric path.
•Deep understanding of AI-specific networking needs, including RoCEv2, DCB, PFC, and RDMA optimization.
•Hands-on experience with automation and orchestration tools such as Python, Ansible, and Terraform.
•Familiarity with GPU-accelerated workloads and distributed systems in AI/ML and HPC environments.
Preferred Qualifications & Certifications:
•CCNP or CCIE Data Center (highly preferred)
•Cisco Certified Specialist – Enterprise Core or Data Center Core
•Additional certifications in NVIDIA Networking (Cumulus/Spectrum) or Arista ACE are a plus
•Experience with hybrid and multi-cloud networking for AI clusters is desirable. Please share your CVs to jobs@inetsoftwaresolutons.co.uk