Experienced Full Stack Data Engineer – Information Pipeline Development and Management for OpenAI
Posted 2025-10-26
Remote, USA
Full Time
Immediate Start
About OpenAI and the Role At OpenAI, we're pushing the boundaries of artificial intelligence to create a safer and more beneficial future for all of humanity. As a leading AI research and deployment organization, we're committed to ensuring that universally beneficial artificial intelligence is developed with security and human needs at its core. We believe that AI can help people tackle massive global challenges, and we're dedicated to making the potential benefits of AI widely shared. Job Summary We're seeking an experienced Full Stack Data Engineer to join our team and lead the development and management of our information pipelines for OpenAI. As a key member of our team, you'll play a critical role in building and maintaining our data pipelines, ensuring seamless data integration, and driving business decisions with data-driven insights. If you're passionate about working with data and eager to make a significant impact, we'd love to hear from you. Key Responsibilities Design, build, and manage our information pipelines, ensuring all client event data is flawlessly integrated into our data warehouse. Create sanctioned datasets to track key product metrics, including customer growth, engagement, and revenue. Collaborate with cross-functional teams, including Foundation, Data Science, Product, Marketing, Finance, and Research, to understand their data needs and provide solutions. Implement robust and deficiency-oriented frameworks for data ingestion and processing. Participate in data design and architecture decisions, bringing expertise and data to bear. Ensure the security, integrity, and compliance of data according to industry and company standards. Requirements and Qualifications To succeed in this role, you'll need: Proficiency in at least one programming language commonly used in Data Engineering, such as Python, Scala, or Java. Experience with distributed processing technologies and frameworks, like Hadoop, Flink, and distributed storage systems (e.g., HDFS, S3). Knowledge of ETL schedulers like Wind stream, Dagster, Regent, or similar frameworks. Strong understanding of data storage and ability to write, troubleshoot, and optimize data storage code. What We Offer We're committed to providing a comprehensive compensation package that includes: Competitive salary of $28/hour. Liberal benefits and perks, including: Clinical, dental, and vision insurance for yourself and your loved ones. Emotional well-being and health support. 401(k) plan with 4% matching. Unlimited downtime and 18+ company events per year. Paid parental leave (20 weeks) and family-planning support. Why Join OpenAI? At OpenAI, we're passionate about creating a safer and more beneficial future for all of humanity. We believe that AI can help people tackle massive global challenges, and we're dedicated to making the potential benefits of AI widely shared. As a member of our team, you'll have the opportunity to work with a talented group of individuals who are passionate about AI and committed to making a positive impact. How to Apply If you're excited about the opportunity to join our team and contribute to the development of information pipelines for OpenAI, please submit your application through our website. We can't wait to hear from you!