Permanent
Posted on 12 August 25 by Laura Bell
Powered by Tracker
Coba is currently recruiting on behalf of one of our long-established clients—a leading global trading and investment firm. We are seeking a talented Full-Stack LLM Application Engineer to help design, build, and scale AI-powered chatbot applications that support high-performance trading, middle office, and back office operations.
This is a hands-on engineering role where you’ll take full ownership of the application stack—from intuitive chat interfaces to secure APIs and backend integrations. You’ll collaborate closely with data scientists and platform engineers to deliver real-time, secure, and scalable AI services in a dynamic, enterprise-grade environment.
Chat UI Development:
Build responsive, user-friendly chat interfaces using React and TypeScript, supporting both desktop and mobile. Implement secure authentication via OAuth/OIDC and support chat and semantic search patterns.
API & Model Serving:
Serve LLMs through FastAPI-based REST/gRPC endpoints. Deploy models using Triton, Ray Serve, or SageMaker with GPU-aware autoscaling.
Security & Access Control:
Enforce fine-grained access controls using OAuth claims. Implement prompt validation, rate limiting, and detailed audit logging.
Legacy System Integration:
Develop low-latency interfaces to connect with existing systems built in Java/.NET, as well as protocols like FIX, Kafka, and message queues.
Monitoring & Cost Optimization:
Set up dashboards to monitor latency, model accuracy, and usage metrics. Manage CI/CD pipelines using GitLab for safe, blue-green deployments.
Infrastructure Automation:
Contribute to Terraform modules for provisioning and managing AWS infrastructure, including EKS clusters, API Gateways, and Lambda functions.
Ready to shape the future of AI in financial services?
Apply now to join a team that’s pushing the boundaries of innovation in one of the world’s most dynamic industries.