Big Data Engineer
Big Data Engineer with Bachelor’s degree in computer science, computer information systems, information technology, or a combination of education and experience equating to the U.S. equivalent of a Bachelor’s degree in one of the aforementioned subjects.
Job duties and Responsibilities:
- Define the end to end solution architecture for large scale technology projects and deep technical expertise in distributed processing, real-time and scalable systems.
- Architect, Design and Develop Big Data streaming applications to use high performance and highly available NoSQL Key Value store Redis for check pointing.
- Design and Develop Spark applications in Scala that use DOM/SAX parsers for parsing incoming raw string/XML data.
- Design and develop AWS Cloud deployment scripts using AWS Cloud Formation Templates, Terraform and Ansible.
- Design, develop and troubleshoot Hive, Pig, Flume, Mango DB, Sqoop, Zookeeper, Spark, MapReduce2, YARN, HBase, Kafka and Strom.
- Fine tune applications and systems for high performance and higher volume throughput and Pre-Process using Hive and Pig.
- Translate load and exhibit unrelated data sets in various formats and sources like JSON, text files, Kafka queues and log data.
- Install and configure Docker images for Telegraf, InfluxDB, Grafana, Kapacitor on AWS cloud monitoring EC2.
- Design and Develop Kapcitor scripts for alerting as push notifications, SMS, Email and Slack alerts.
- Define Technology/Big Data strategy and roadmap for client accounts, and guides implementation of that strategy within projects.
- Drive excellent management skills are required to deliver complex projects, including effort/time estimation, building detailed work breakdown structure (WBS), managing critical path, and using PM tools and platforms.
- Build scalable client engagement level processes for faster turnaround & higher accuracy.
- Run regular project reviews and audits to ensure that projects are being executed within the guardrails agreed by all stakeholder.
- Manage the team-members, to ensure that the project plan is being adhered to over the course of the project.
- Manage the client stakeholders, and their expectations, with a regular cadence of weekly meetings and status updates.
Skills / Knowledge required
- Knowledge of variety and advanced architecture, tools and concepts across all layers of the modern distributed technology stack (Hadoop, Spark, Kafka, Cassandra, MongoDB and similar).
- Knowledge and experience in cloud architectures and cloud tools (Azure/GCP/AWS).
- Knowledge and experience of deploying and maintaining large scale advanced systems in production environments.
- Ability to independently manage client engagements from start to finish, delivering actionable insight within established timelines and budget.
- Demonstrate ability in driving business impact through the application of technology and data science. Understanding and leveraging the connection between data structures, analytical methods, and business applications.
- Experience working in a consultative capacity (internal or external) and across the sales-to-project closure process.
- Strong analytical thinking skills. Ability to creatively solve business problems, innovating new approaches where required.
- Strong organizational awareness and the ability to work effectively at multiple levels within an organization. Equally comfortable in discussing technical/analytical details with technical thought leaders as explaining technical subject matter with a non-technical audience, in particular, high-level executives.
- Outstanding verbal and written communication skills & must have excellent project management skills and have experience managing multiple work streams and projects at one time.
- Building - and, when necessary, rebuilding -- and leading high performance teams.
- Proven track record of identifying and developing technology talent and leading a team of high-performance personnel.
- Experience with relational SQL and NoSQL databases, including Postgres and Cassandra.
- Experience with data pipeline and workflow management tools: Azkaban, Luigi, Airflow, etc.
- Experience with AWS cloud services: EC2, EMR, RDS, Redshift.
- Experience with stream-processing systems: Storm, Spark-Streaming, etc.
- 4+ years of experience with object-oriented/object function scripting languages: Python, Java, C++, Scala, etc.
Work location is Portland, ME with required travel to client locations throughout USA.
Rite Pros is an equal opportunity employer (EOE).
Please Mail Resumes to:
Rite Pros, Inc.
415 Congress St, Suite # 201 & 202
Portland, ME 04101
Email: resumes@ritepros.com