PubMatic
Technology
SeniorSoftwareEngineer-DataAnalytics
Neural analysis suggests this role is
optimal for Senior candidates.
“Senior Software Engineer - Data Analytics at PubMatic. Skills: Scala, Spark, Kafka, Hadoop, GenAI. Build, design, and implement highly scalable, fault-tolerant big data platform; Develop backend services; Build and maintain Big Data pipelines; Architect and implement real-time data processing workflows; Design and develop GenAI-powered agents; Integrate LLMs into existing services; Manage end-to-end GenAI workflows; Work closely with cross-functional teams; Participate in Agile/Scrum processes; ”
What You'll Achieve.
Enhance data analytics capabilities, enable platform to handle complex information retrieval, contextual understanding, and adaptive interactions, ultimately improving data-driven insights and analytical workflows.
Industry & Context.
data structure; algorithm design; architectural specifications; distributed systems
We don’t just value diversity, we promote and celebrate it. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
What They're Looking For.
Must Have
1-5 plus years of coding experience in Java and backend development. Solid computer science fundamentals, including data structure and algorithm design, and creation of architectural specifications. Expertise in developing the Implementation of professional software engineering best practices for the full software development life cycle, including coding standards, code reviews. Hands-on experience with Big Data tools and systems like Scala Spark, Kafka, Hadoop, Snowflake. Proven expertise in building GenAI applications, including: LLM integration (OpenAI, Anthropic, Cohere, etc. ) LangChain, or similar agent orchestration libraries Prompt engineering, embedding, and retrieval-based generation (RAG) Experience in developing and deploying scalable, production-grade AI or data systems. Ability to lead end-to-end feature development and debug distributed systems. Experience in developing and delivering large-scale big data pipelines, real-time systems.
What You'll Do.
Build, design, and implement highly scalable, fault-tolerant big data platform
Develop backend services
Build and maintain Big Data pipelines
Architect and implement real-time data processing workflows
Design and develop GenAI-powered agents
Integrate LLMs into existing services
Manage end-to-end GenAI workflows
Work closely with cross-functional teams
Participate in Agile/Scrum processes
Discuss software features with product managers
Support customer issues
Perform code and design reviews.
How You'll Work.
Team & Collaboration
Working closely with cross-functional teams on improving the availability and scalability of large data platforms and the functionality of PubMatic software.
Communication Scope
discuss with product managers; customer issues
Process & Methodology
Agile/Scrum processes, sprint planning, sprint retrospective, backlog grooming, user story management, work item prioritization
Full Job Description
About the Role PubMatic is seeking Data Analytics-focused Senior / Software Engineers with expertise in building and optimizing AI agents, including strong skills in Hadoop, Spark, Scala, Kafka, Spark Streaming, and cloud-based solutions, with proficiency in programming languages such as Scala and Python. You will be responsible for developing advanced AI agents that enhance our data analytics capabilities, enabling our platform to handle complex information retrieval, contextual understanding, and adaptive interactions, ultimately improving our data-driven insights and analytical workflows. What You'll Do: Build, design, and implement our highly scalable, fault-tolerant big data platform to process terabytes of data and provide customers with in-depth analytics. Develop backend services using Java, REST APIs, JDBC, and AWS. Build and maintain Big Data pipelines using technologies like Spark, Hadoop, Kafka, and Snowflake. Architect and implement real-time data processing workflows and automation frameworks. Design and develop GenAI-powered agents for analytics, operations, and data enrichment use cases using frameworks like LangChain, LlamaIndex, or custom orchestration systems. Integrate LLMs (e.g., OpenAI, Claude, Mistral) into existing services for query understanding, summarization, and decision support. Manage end-to-end GenAI workflows including prompt engineering, fine-tuning, vector embeddings, and retrieval-augmented generation (RAG). Working closely with cross-functional teams on improving the availability and scalability of large data platforms and the functionality of PubMatic software. Participate in Agile/Scrum processes such as sprint planning, sprint retrospective, backlog grooming, user story management, and work item prioritization. Frequently discuss with product managers about the software features to include in the PubMatic Data Analytics platform. Support customer issues over email or JIRA (bug tracking system), provide updates, and patches to
Applying for this Senior Software Engineer - Data Analytics role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about PubMatic?
Real rants from real employees. Read before you apply.