Source Information Services Limited logo
8 days ago
Full-time
On-site
London, London, City of, United Kingdom
Data Engineer

Purpose

Our core asset is our data, and we are looking for a specialist who can not only maintain our high-standard data infrastructure while our Lead Data Engineer is on paternity leave but also accelerate our evolution into an AI-first organization.Ā Ā 

Ā 

This role is a unique hybrid of stability and innovation. You will ensure our existing pipelinesĀ remainĀ robust while leading the charge on AI improvements to our internal operations, systems, and client-facing products.Ā 

Ā 

You will be key to helping us extract new insights, provide deeper analysis, and enable AI-driven self-service capabilities for ourĀ internal and external users.



KeyĀ facetsĀ of this role

  • AI Integration & Innovation: Design and deploy AI-driven features to automate internal operations and enhance our qualitative/quantitative research assets.Ā 
  • Vector Infrastructure: Build andĀ maintainĀ vector databases and RAG (Retrieval-Augmented Generation) pipelines to unlock the value of our unstructured data.Ā 
  • Pipeline Evolution: Transform existing ETL/ELT processes into AI-ready pipelines, ensuring data quality for machine learning training and inference.Ā 
  • System Maintenance: Provide interim stewardship of our core data platform, ensuring uptime and performance while the Lead Data Engineer is away.Ā 
  • Technical Mentorship: Act as the internal subject matter expert, upskilling the broader team on MLOPs and AI data best practices.Ā 
  • Operational AI: Implement agentic workflows or automated insights to turn raw data into "AI-driven self-service" capabilities for our global clients.



The type of person we need in this role

This role can only be done effectively by someone who:Ā 

  1. Experience: 4+ years in Data Engineering, with at least 2 years focused on AI/ML implementation (LLMs, NLP, or predictiveĀ modeling).Ā 
  2. AI Toolkit: Proven experience with Vector Databases (e.g.,Ā OpenSearch,Ā CosmosDB, Milvus) and frameworks likeĀ LangChainĀ orĀ LlamaIndex.Ā 
  3. Core Engineering: DeepĀ proficiencyĀ in Python and PostgreSQL.Ā 
  4. Big Data & Ops: Hands-on experience with Apache Spark (PySpark) and workflow orchestration (e.g., Airflow, Prefect, orĀ Dagster).Ā 
  5. Cloud & Warehouse: Extensive experience with a major cloud provider (AWS/Azure/GCP) and modern warehouses like Snowflake, Redshift, orĀ BigQuery.Ā 
  6. DevOps Mindset: Proficient with Git,Ā CI/CDĀ and the operationalisation of ML models (MLOps).Ā 
  7. Adaptability: The ability to step into a leadership gap, manage existing priorities, and pivot quickly toward innovation.Ā 



The qualities we’re looking for

  • Problem-Solver:Ā A proactive and analytical mindset, with the ability to diagnose and solve complex data andĀ AI/ML infrastructure challenges.Ā Ā 
  • Collaborative & Enabling:Ā Excellent communication and interpersonal skills, withĀ a strong desireĀ to teach, mentor, and shareĀ expertise effectively with Data Analysts, the Senior Data Engineer, and other stakeholders.
  • Detail-Oriented:Ā Meticulous attention to data quality, integrity, and pipeline robustness.
  • Adaptable:Ā Eagerness to learnĀ new technologiesĀ and adapt to evolving ML/AI landscapes.Ā Ā 
    • Impact-Driven: A desire to contribute directly to the success of data-driven products and business outcomes, particularly in enabling new insights and self-service capabilities.



    What we offerĀ 

    • Strong professional development and continued learning.Ā 
    • Hybrid work environmentĀ (2 days minimum in our London office)Ā with core hours and time flexibility.Ā 
    • Enhanced pension contributionsĀ 
    • Annual profit share schemeĀ 
    • 28 days annual leaveĀ 
    • Learning and development cultureĀ 
    • Health helplinesĀ 
    • Enhanced parental leave.Ā 
    • Cycle to work scheme.Ā 
    • Death in service insuranceĀ 



    AboutĀ us

    Source is a research-led advisory firm thatĀ helpsĀ the world’s largest professional services firms make their most important decisions.Ā Ā 

    With a wealth of independent insight, knowledge, and experience in the industry, Source delivers clear-cut direction that gives firms and their leaders the confidence to act.Ā Ā 

    Ā 

    As ourĀ AI &Ā Data Engineer, you will be instrumental in enabling us to build robust, deep, and valuable data through advanced analytics and AI-driven capabilities.Ā 



    DetailedĀ roleĀ 

    • BuildĀ scalable data pipelines for ML and AI applications.Ā Implement robust data ingestion strategies from diverse sources (e.g., databases, APIs, streaming services) andĀ participateĀ in designing efficient data transformation pipelines to prepare our qualitative and quantitative data for machine learning and AI consumption, setting standards for the team.Ā 
    • Champion strategies for preparing diverse data into AI-ready features.Ā Collaborate with Data Analysts to understand data needs for new insights and self-service tools, then lead the design and structuring of data appropriately for various AI and ML applications, guiding other engineers in these practices.Ā 
    • Steer the cloud data platform's evolution to enhance AI/ML capabilities.Ā Participate in optimising the usage of cloud-native data and ML services to ensure cost-efficiency, scalability, and high availability of the data platform, with a focus on AI/ML readiness, and advise the team on technology choices.Ā 
    • Optimise data processing and ML pipelines for efficiency and scale.Ā ProactivelyĀ identifyĀ and resolve performance bottlenecks in data pipelines and ML workloads, ensuringĀ optimalĀ system efficiency, and sharing techniques for performance tuning with the team.Ā 
    • Be the go-to expert for ML/AI data engineering best practices.Ā ActivelyĀ participateĀ in technical discussions, conduct code reviews, and lead knowledge sharing sessions with Data Analysts, the Senior Data Engineer, and other engineering teams to foster a data-driven culture and elevate ML/AI understanding.Ā 



    Diversity & Inclusion

    At Source, we are committed to encouraging equality, diversity, and inclusion among our workforce, andĀ eliminatingĀ unlawful discrimination.Ā 


    We are determined to ensure that no applicant or employee receives less favourable treatment on the grounds of gender reassignment, age, disability, religion or belief, sex, sexual orientation, marital status, or race, or is disadvantaged by conditions or requirements which cannot be shown to be justifiable.Ā 


    The aim is for our workforce to be truly representative of all sections of society and our customers, and for each employee to feel respected and able to give their best.