Senior Data/Software Engineer

-
Full-Time
Remote

Job Description:

About the client:

The client has a chat-based shopping agent built exclusively for fashion. Designed to redefine how people search and discover fashion, and offers a personalised, conversational experience powered by advanced AI and natural language understanding.

About the role:
As a Data Engineer you will be a foundational member of the team, responsible for designing and building the entire data ecosystem that fuels our AI Personal Stylist. This is a unique opportunity to solve complex technical challenges while directly shaping a product that will revolutionise how people shop online.

Responsibilities:

Design, build, and optimize scalable, parallel data processing pipelines on Google Cloud to handle massive volumes of offline data.
Implement and manage large-scale LLM batch inference jobs, processing millions of data points to enrich our product catalog with sophisticated, AI-generated attributes.
Architect and own the data infrastructure for our Fashion Knowledge Graph, leveraging BigQuery and parallel data processing frameworks
Develop and maintain robust feature generation pipelines to craft high-quality signals for both the training and inference of our machine learning models.
Orchestrate complex workflows of data processing jobs, implementing robust monitoring, alerting, and data quality validation systems to ensure reliability and trust in our data.
Collaborate closely with data science and machine learning teams to understand data requirements and deliver production-grade data solutions.
Champion engineering best practices, including writing clean, maintainable Python and SQL, and drive a culture of high-quality data and operational excellence.

Required Skills & Experience:

Data engineering background with strong generalist skills, comfortable with writing code with AI (while still owning it) in various programming languages
Handling large volumes of data at scale, data warehousing/data lakes. Apache Spark, Polars, Snowflake, etc
Hands-on experience with workflow orchestration tools like Airflow, Argo, Azkaban or Kubeflow.
Great understanding of SQL
Comfortable with CI/CD, testing
Understands general theoretical complexity (Big O notation)
Deep understanding of the web stack, useful for scraping, owning problems across the stack
Great understanding of AI stack - what is possible, what is not, what is expensive
High autonomy, comfortable with rapidly shifting priorities in a start-up environment

Nice to have experience:

GO / Java experience
Scrapy Experience
Elasticsearch experience (or more generally custom search engine experience)
Data-science background - ML experiments, feature extraction, etc
Google Cloud experience

Additional information:

- Work with some of the most dynamic US tech companies, building and iterating on new features and platforms.

- Long-term projects with real technical challenges.

- Fully remote work with flexible hours.

- Collaboration flexibility: We work with B2B (PFA/SRL) contracts.

- 30 paid days off per year.

- We provide equipment as needed (laptop, desktop, etc.).

- Continuous learning: We sponsor career-improving courses, seminars, and certifications.

- Opportunity for annual business visits to the US, depending on project needs.

Get picky and choose a career that matches your mindset and lifestyle. Team up with a company that encourages you to do more and gives you the flexibility you need!