Job Description

Paperpile runs on data at scale, with a literature database of 250M+ academic papers and a growing body of user data accumulated over more than a decade. You'll work across the systems that ingest, process, store, and serve this data reliably: building pipelines, optimizing search, handling PDFs at scale, and exposing clean APIs.

Requirements

Strong backend engineering background with experience building and operating data-heavy systems in production.
Experience deploying and operating services on AWS.
Experience designing and maintaining data ingestion pipelines handling messy, heterogeneous sources. Comfortable with web scraping and working with third-party data sources and APIs.
Familiarity with Node.js and TypeScript. It’s fine if you come from a different background, such as Java or Python, but you should be comfortable working in this environment.
High standards for data quality. You think carefully about correctness, deduplication, and consistency.
Solid understanding of full-text search systems including indexing strategy, relevance tuning, and query optimization.
Proficient in building reliable REST APIs.

More useful experience:

Familiarity with academic publishing formats and data sources (PubMed, Crossref, arXiv…)
Experience with PDF processing pipelines (extraction, transformation, storage and delivery at scale).
Experience with LLM-based document processing or ML pipelines for extracting structured data from unstructured text.
Large scale web crawling and scraping.

Benefits

Base compensation €60,000–€90,000 based on the level of your experience
Bonus/equity program.
4 weeks paid vacation + local holidays.
We sponsor co-working space in your city.
Learn and grow. Try out new things. We sponsor relevant courses, seminars, and conferences.

Apply Now

Job Summary

Location
Remote Austria
Category
Software development Data Engineer
Date Posted
2 hours ago
Job Level
Senior

Application Solution Engineer, Servo Motor & Motion Control | Mandarin Preferred

Job brief We are currently partnering with a leading industrial automation enterprise to recruit a full-time, US-based Application Solution Engineer focused on servo mot

engineer
exec

Director of Engineering, Data Infrastruture

Director of Engineering, Data Infrastructure Role Summary Our mission at HubSpot is to help millions of organizations grow better. The Data Infrastructure group powers th

exec
engineer

Head of Engineering

We are looking for a Head of Engineering for the Marketplace and Payments areas at Treatwell. We are Europe’s leading platform for booking hair and beauty appointments, s

Head Level
engineer
exec

Consultants, Designers and Engineers Manager, UK & Ireland (All Genders)

Consultants, Designers and Engineers Manager, UK & IrelandReporting to: Senior Manager - Sales and Consultant Program🌍 Genetec - Protect the everydayWe are a global Cana

exec

Senior Backend Engineer (Data, Search, Infrastructure)

Job Description

Remote Austria

Software development Data Engineer

2 hours ago

Senior

Application Solution Engineer, Servo Motor & Motion Control | Mandarin Preferred

Director of Engineering, Data Infrastruture

Head of Engineering

Consultants, Designers and Engineers Manager, UK & Ireland (All Genders)

Find Remote Jobs

About us

Additional

Senior Backend Engineer (Data, Search, Infrastructure)

Job Description

Remote Austria

Software development Data Engineer

2 hours ago

Senior

Application Solution Engineer, Servo Motor & Motion Control | Mandarin Preferred

Director of Engineering, Data Infrastruture

Head of Engineering

Consultants, Designers and Engineers Manager, UK & Ireland (All Genders)

Subscribe to Job Alerts

Find Remote Jobs

About us

Additional