Data Engineer
What has been the effect of disinformation on the world?
Blackbird.AI creates leading-edge AI software to provide critical real-time insights to provide our clients with a deep understanding of ongoing disruptive narratives, their motives, and overall digital noise. We are united by our dedication to our mission. We believe that we have a responsibility to society and that our service is vitally needed by organizations and individuals to create an empowered and critical thinking society.
If this mission resonates with you, we'd love to hear from you.
The Opportunity:
Get ready to join a small but growing team of highly talented engineers and leaders, building exciting AI-driven services and technologies. As a Data Engineer for Blackbird.AI, you will own the pipeline optimization for a real-time streaming cloud-hosted analytics platform that spans data collection and analysis, and serves results to a user dashboard for interactive visual exploration. Our position requires a breadth of experience with database technologies, especially the engineering of horizontally scalable solutions for big data.
Responsibilities:
Writes ETL processes to support ingestion and normalization of a wide variety of social media, news, and web scrape formats
Designs database systems and develops tools for query and analytic processing, including for streaming real-time applications
Performs analysis and comparative empirical studies to evaluate performance tradeoffs with respect to scaling (e.g., cost vs throughput/latency)
Develops, manages and owns the database architecture for a real-time streaming cloud hosted analytics platform, spanning data collection, analytics and user management
Owns build automation, continuous integration, deployment and performance optimization in compliance with our security requirements
Requirements
Must Have:
BS degree in Computer Science or equivalent
Demonstrated product success with deployment in the cloud and SaaS model; proven capability to develop processing pipeline for platforms that are optimized for streaming analytics applications and that are cloud agnostic (Kubernetes, dockerized solutions)
Expert level capable on PostgreSQL, Neo4j (graph), ElasticSearch, MongoDB, Redis, Druid, with other NoSQL and graph DBs helpful
Experienced with horizontal scaling of databases
Experienced with Kafka and Airflow; expert in applying tools for runtime profiling to optimize throughput and latency and establish comparative performance benchmarks
Capable in build automation, continuous integration and deployment (CI/CD) tools, e.g. Webpack, Buddy or using Jenkins + docker
Expert level Python code development
Experience working with distributed teams
Helpful to Have:
Technical background in Artificial Intelligence (AI) and Machine Learning (ML)
Experience designing and implementing interactive query-driven man-machine intelligence systems
Solid skills in Java
Benefits
Health Care Plan (Medical, Dental & Vision)
Paid Time Off (Vacation, Sick & Public Holidays)
Work From Home
Stock Option Plan
Exciting career development prospects, to grow into leadership roles