Database Engineer at DataRobot
Company Overview:
DataRobot is the AI Cloud leader, delivering a unified platform for all users, data types, and environments to accelerate the delivery of AI to production for every organization. A third of the Fortune 50 currently trusts our Augmented Intelligence platform. We are growing and scaling rapidly with a massive market opportunity in sight. As a DataRobot employee, you will be part of a company whose mission is clear - bring a world class Machine Learning platform and change outcomes for organizations worldwide. We’re at the forefront of the AI revolution, from banking to insurance to healthcare. And we need other revolutionaries to join us.
Our Service Reliability Database team supports all of the services that comprise DataRobot, allowing services to run, seamlessly scale, and tolerate outages across 3 major clouds. We are building a multi-tenanted Kubernetes environment for all DataRobot services from the ground up using cutting edge cloud native technologies and approaches - fully declarative deployment, advanced policy engines, zero trust security, comprehensive auto scaling, and best in class observability with distributed tracing.
Position Overview:
Datarobot is recruiting a Database Engineer who can bring in creativity, knowledge, expertise and great enthusiasm to join our Service Reliability Database team. This critical role is an exciting opportunity that will involve administering, automating, and troubleshooting our cloud database systems which is a critical component of our company. This role will give you a perfect opportunity to hone your skills and have exposure to a wide range of database technologies.
Essential Functions:
Design, implement and maintain database systems that are reliable, scalable and secure to support AI/ML model deployments and statistics
Support, administer and manage production database systems by proactively monitoring performance, security and availability
Instrument database backup and recovery process automation
Provide thoughtful and responsive management support for database systems and application teams
Perform upgrades to latest releases while working closely with different teams and project management
Resolve critical issues/ deploy and maintain database clusters including configuration enhancements and database tuning
Solve non-routine problems and determine the best practices and strategies for recurring database issues.
Collaborate with different application teams to implement database changes and make sure all issues, incidents and projects are addressed in timely manner
Periodically review guidelines and documentation for improvements and assist in implementing standards and streamline process
Participate in on-call database issues and incident response to address critical issues and outages
Develop scripts and tools to automate database administration tasks
Bring strong comprehensive problem-solving skills to identify and resolve problems
Collaborate and work as a part of geographically dispersed team to ensure data availability, accessibility and security
Desired skills/ Experience:
Over 5+ years of experience managing large scale database systems mainly PostgreSQL and MongoDB in SaaS
Deep knowledge and understanding of advanced PostgreSQL HA concepts, architecture and internals
Proficient with MongoDB replication, sharding and data modeling
2+ years of experience using any cloud services stack such as AWS, GCP and Azure
Experience in database reliability domain with strong grasp on database performance, scaling and tuning concepts
Work with different domains and developers to write optimized SQL
Automate repeatable tasks using scripting language such as bash, python and ansible
Experience with container orchestration
Experience with data migration services to AWS
Nice to have:
Hands on experience in using version control systems, configuration management tools and IaaC
Experience using database tools such as pgadmin, pgsql, robo3t, studio3t, MongoDB Ops Manager and Mongo mirror
The talent and dedication of our employees are at the core of DataRobot’s journey to be an iconic company. We strive to attract and retain the best talent by providing competitive pay and benefits with our employees’ well-being at the core. Here’s what your benefits package may include depending on your location and local legal requirements: Medical, Dental & Vision Insurance, Flexible Time Off Program, Paid Holidays, Paid Parental Leave, Global Employee Assistance Program (EAP) and more!
Our Core Values:
Be You - We are co-contributors to our culture committed to diversity, inclusion, equity and a sense of belonging throughout our global community. We value each person as their authentic self and welcome honest and respectful dialogue as an avenue to greatness.
Dream Big - We don’t settle for the status quo. We always pursue the development of the next version of ourselves. We believe investment in the growth and resilience of our people is the foundation of building an iconic company.
Be Data-Driven - We believe data-driven decision making enables innovation and execution at velocity while ensuring repeatable success. We take on the toughest challenges as we transform the world with our platform and AI.
Better Together - We celebrate success and learn from failure as one team. We foster an aligned, collaboration-first workplace by building trust through transparency, integrity and accountability.
Champion Customers - We empower our customers and our communities by shaping the future of trusted, augmented intelligence in order to solve their most difficult problems. We are humble, not complacent. We succeed when they succeed.