Sr. Technical Solutions Architect

Date: Jun 2, 2026

Location: Chicago, IL, US Dallas, TX, US MA, US MD, US Denver, CO, US FL, US TX, US

Company: Softchoice

Why you’ll love Softchoice:

We are a software-focused IT solutions and services provider that equips organizations to be agile and innovative, and for their people to be engaged, connected, and creative at work. That means moving them to the cloud, helping them build the workplace of tomorrow, and enabling them to make smarter decisions about their technology. By doing these things we help them create success for their customers and their people.

We stand proudly for our people and support their success through career development and advancement. We are recognized and respected for our culture of inclusion and belonging, continuously striving to do what’s good for our people and communities. 
 

The impact you'll have:

 

We are seeking a Senior Technical Solutions Architect — AI to serve as a hands-on, platform-agnostic technical architect for our strategic AI engagements. This person sits at the intersection of customer strategy, applied AI engineering, and modern software delivery. They translate ambiguous business problems into working prototypes, scalable reference architectures, and production-grade solutions across public-cloud hyperscaler AI platforms and sovereign (on-premise / private) AI environments.

The ideal candidate is equally comfortable whiteboarding an agentic architecture with a CIO, writing the proof-of-concept code that proves it works, and guiding a client engineering team through the secure path to production. They are vendor-fluent but vendor-neutral — recommending the right tool for the workload, the data, the risk profile, and the budget.

 

What you'll do

 

Solutioning & Architecture

  • Design end-to-end AI solutions spanning Generative AI (RAG, CAG, GraphRAG, fine-tuning, model distillation) and agentic AI (tool-using agents, multi-agent orchestration, MCP-based integrations).
  • Architect across all major hyperscaler AI stacks — AWS (Bedrock, SageMaker, Q), Microsoft Azure (Azure AI Foundry, Azure OpenAI), and Google Cloud (Vertex AI, Gemini) — and recommend the right platform per workload rather than defaulting to a single provider.
  • Architect sovereign / on-premise AI solutions using stacks such as NVIDIA AI Enterprise (NIM, NeMo, Blueprints), Dell AI Factory, HPE Private Cloud AI, Red Hat OpenShift AI, Run:ai, and open-source model serving (vLLM, TGI, Ollama) — for clients with data residency, regulatory, IP, or air-gapped requirements.
  • Develop reusable reference architectures, decision frameworks, and trade-off analyses (cost, latency, accuracy, governance, sovereignty) that scale across the practice.

 

Rapid Prototyping

  • Build working prototypes — not just slides. Translate client problem statements into functional demos and pilots in days, not months.
  • Stand up RAG, CAG, and agentic workflows quickly using frameworks such as LangChain / LangGraph, LlamaIndex, CrewAI, AutoGen, Semantic Kernel, and MCP-compliant agent toolchains.
  • Integrate vector stores (Pinecone, Weaviate, Milvus, Chroma, pgvector, OpenSearch), graph stores (Neo4j, Neptune), and hybrid retrieval pipelines as the use case demands.
  • Run rigorous, repeatable evals on prototypes (groundedness, faithfulness, latency, cost-per-task, tool-use accuracy) so recommendations are evidence-based.

 

AI-Native Engineering & Modernization

  • Lead solutioning for AI-native software engineering engagements: AI-assisted development, code refactoring at scale, tech debt burndown, legacy modernization, test generation, and documentation regeneration.
  • Architect Secure SDLC (SSDLC) practices into every AI-built or AI-assisted codebase — threat modeling, SAST/DAST integration, SBOM generation, dependency hygiene, secrets management, and supply-chain security.
  • Advise clients on integrating AI coding agents (Claude Code, Cursor, GitHub Copilot Workspace, Devin, and others) into their existing SDLC and DevSecOps toolchains without compromising guardrails.
  • Define MLOps / LLMOps / AgentOps patterns: model and prompt versioning, evaluation pipelines, observability (traces, token usage, drift), guardrails, and human-in-the-loop review.

 

AI Security

  • Conduct AI-specific threat modeling for every solution — covering adversarial inputs, prompt injection, jailbreaking, model inversion, training data extraction, and indirect injection via tool outputs or retrieved documents — and translate findings into concrete mitigations in the architecture.
  • Design multi-layer guardrail architectures: input sanitization and intent classification, output filtering (PII redaction, toxicity screening, factual grounding checks), content safety policies, and fallback / refusal handling — covering both hosted API models and self-hosted open-weight deployments.
  • Enforce least-privilege access control for agentic systems: scope tool permissions, define agent authorization boundaries, audit and log all tool invocations, and ensure agents cannot escalate privileges or exfiltrate data outside approved boundaries.
  • Maintain end-to-end AI supply chain security: vet third-party model weights and datasets for backdoors or poisoning, validate fine-tuned model integrity, enforce cryptographic signing of model artifacts, and apply model cards and datasheets as governance artifacts.
  • Align AI solutions to applicable compliance frameworks — NIST AI RMF, OWASP LLM Top 10, ISO/IEC 42001, EU AI Act, and relevant sector-specific regulations — and produce the risk documentation, impact assessments, and audit trails clients need to satisfy internal governance and external regulators.

 

Client Engagement & Enablement

  • Serve as the senior technical voice in client conversations — from executive briefings through deep technical design sessions.
  • Partner with sales, delivery, and practice leadership to scope statements of work, estimate effort, and de-risk delivery.
  • Mentor architects, engineers, and consultants across the broader AI practice; raise the technical bar through code reviews, internal enablement, and reusable assets.
  • Stay ahead of the field — evaluate emerging models, frameworks, and protocols (e.g., MCP, A2A, ACP, new agent frameworks, new sovereign AI stacks) and bring well-reasoned points of view back to the practice.

 

What you'll bring to the table:

 

  • 8+ years of progressive experience in software engineering, solutions / Enterprise architecture, or applied AI/ML, with at least 2+ years in a hands-on Generative AI or agentic AI role.
  • Demonstrated ability to rapidly prototype AI solutions and ship working code — not just designs or documents.
  • Deep, hands-on experience with at least one of the three major hyperscaler AI platforms (AWS, Azure, GCP) and a working understanding of the second and third.
  • Production experience designing and shipping RAG and/or agentic systems, including practical familiarity with chunking strategies, embedding model selection, retrieval evaluation, and orchestration patterns.
  • Working knowledge of MCP (Model Context Protocol) and modern agent-tool integration patterns; ability to design MCP servers and clients, and to reason about when MCP is the right abstraction versus alternatives.
  • Strong understanding of CAG (Cache-Augmented Generation), RAG variants (naive, hybrid, GraphRAG, agentic RAG), and the trade-offs between each.
  • Proficiency in Python; comfort in at least one additional language (TypeScript/JavaScript, Go, Java, or C#).
  • Experience integrating with enterprise systems: REST/GraphQL APIs, event streams (Kafka, EventBridge), identity (OIDC, SAML, OAuth2), and enterprise data platforms (Snowflake, Databricks, Fabric, BigQuery).
  • Excellent written and verbal communication; able to move fluidly between executive narrative and engineering whiteboard.
  • Foundational fluency in AI security concepts: able to identify and articulate risks such as prompt injection, data poisoning, model extraction, and inference-time attacks, and to reason about appropriate mitigations for each in the context of a given architecture and risk tolerance.

 

Strongly Preferred

  • Software development background with real production experience across the SDLC and Secure SDLC (SSDLC) — including CI/CD, infrastructure as code (Terraform, Pulumi, Bicep), containers and Kubernetes, and DevSecOps tooling.
  • Experience leading code refactoring, technical debt remediation, and legacy modernization programs — ideally with AI-assisted approaches.
  • Experience designing sovereign / on-premise AI deployments: NVIDIA NIM / NeMo, OpenShift AI, Run:ai, vLLM at scale, GPU capacity planning, and on-prem vector / graph stores.
  • Background in security and governance: prompt injection defense, output filtering, data loss prevention, model risk management, NIST AI RMF, ISO/IEC 42001, and EU AI Act readiness; familiarity with the OWASP LLM Top 10, adversarial ML attack taxonomies (MITRE ATLAS), and red-teaming / evaluation techniques for LLMs; experience translating these frameworks into practical control designs rather than checkbox compliance.
  • Experience fine-tuning, distilling, or post-training open-weight models (Llama, Mistral, Qwen, Gemma) for enterprise use cases.
  • Industry experience in regulated verticals (financial services, healthcare, public sector, defense) where sovereignty and compliance are non-negotiable.
  • Relevant certifications (AWS / Azure / GCP AI specialty, CKA/CKAD, CISSP, NVIDIA-certified) — useful, but capability is weighted more heavily than credentials.

 

Education

Bachelor's degree in Computer Science, Engineering, Mathematics, or a related technical field, or equivalent demonstrable experience. Advanced degree is welcomed but not required.

 

What Sets a Great Candidate Apart

  • A pragmatic, opinionated point of view on when not to use GenAI or agents — and the judgment to steer clients toward the right answer even when it isn't the flashy one.
  • Curiosity that runs ahead of the market: already experimenting with the next protocol, the next model, the next orchestration pattern before clients ask.
  • Comfort with ambiguity — the ability to walk into a half-formed problem, frame it, prototype against it, and leave the client with a clearer path forward than they had that morning.

 

Location & Travel

Remote-friendly with periodic travel to client sites and internal events (estimated 0–15%).

 

Compensation:

 

Corporate/Pay Mix: A reasonable estimate of the current base pay range for this position is $124,320 to $155,400 annually + 30% Target Incentives

 

Actual salary will be based on a variety of factors, including location, experience, skill set, education, and related certification. The range for this position in other geographic locations may differ.

 

Softchoice offers a comprehensive and competitive benefit plan to all full-time employees, which includes:

  • Health and Wellbeing: Medical, Dental, and Vision Care, Flexible Spending Account, Employee Assistance Program
  • Financial Benefits: 401k Plan with Company Matching, Life and Disability Insurance
  • Paid Time Off: PTO and Sick Leave (starting at 20 days per year), Holidays, Parental Leave, Volunteer Days, Bereavement Leave
  • Additional Perks: Employee Discount Program

 


Not sure if you qualify? Think about applying anyway:

 

We understand that not everyone brings 100% of the skills and experience for the role.

At Softchoice, we offer opportunities to a diverse group including those with a variety of workplace experiences and backgrounds.  Whether you are new to corporate tech, returning to work after a gap in employment, or looking to transition and take the next step in your career, we are excited to learn more about you and encourage you to apply.

 

Why You’ll Love Working Here:

 

  • The People: You’ll thrive in our collaborative environment, surrounded by incredible colleagues who foster support and innovation, driving our collective success
  • High-Performing Culture: At Softchoice, we are dedicated to achieving our goals and committed to success for our customers and each other
  • Flexibility: Plan your workdays in a way that suits you best
  • Award-Winning Workplace: Proudly recognized as a Great Place to Work for 20 consecutive years
  • Inclusive Culture: We are committed to an inclusive culture where every team member can be their authentic self
  • Competitive Benefits: Benefit from competitive perks that start on day one

 

Inclusion & Equal opportunity employment:

 

We are an equal opportunity employer committed to diversity, inclusion & belonging. People seeking employment at Softchoice are considered without regard to any protected category including but not limited to, race, color, religion, national origin, age, sex, marital status, ancestry, disability, veteran status, gender identity, or sexual orientation.

 

Require accommodation? We are ready to help:

 

We are proud to provide interview & employment accommodation during the recruitment and hiring process. If you require any accommodation to apply or interview for a position, please reach out directly to asktalentacquisition@softchoice.com. We are committed to working with you to best meet your needs.

 

Our commitment to your experience:

 

We are committed to the safety of all applicants and team members. With that in mind, we have implemented digital interviewing for everyone. We understand that you may need to interview with distractions around you (such as children or furry friends) and we will be doing the same.

 

Before you start with us, we will conduct a criminal record check, verify your education, and check your references.

 

When you join Softchoice, we will onboard you remotely. Don't worry. It's quick, simple and you'll be connected with your new team in no time.

 

Job Requisition ID: 7483 

EoE/M/F/Vet/Disability  

#LI-RR1


Nearest Major Market: Chicago