Hi, I'mAjith Annavarapu

    Data Engineer & Data Scientist

    I transform raw data into meaningful insights that drive business decisions. With expertise in building robust data pipelines and scalable data systems, I help organizations harness the full potential of their data.

    I'm always open to new challenges and collaborations. If you're working on an interesting project that requires data expertise, A.I or even Vibe Coding. I'd love to hear about it!

    My Journey

    From curious data enthusiast to professional data engineer.

    My journey in the data field began with a fascination for extracting insights from raw information. Starting with a solid foundation in computer science, I quickly developed expertise in data engineering and analytics, working across various domains and technologies.

    Along this path, I've continually expanded my skills, adapting to emerging technologies and methodologies, and building a comprehensive toolkit that allows me to tackle complex data challenges with confidence and creativity.

    Skills & Expertise

    A comprehensive toolkit built through years of experience and continuous learning.

    Languages & Databases

    Python
    SQL (Postgres, MS-SQL, MySQL)
    Java
    C
    React
    Cassandra

    Cloud & DevOps

    AWS (Glue, Lambda, S3, EC2, EMR, ECR)
    API Gateway
    KMS
    SageMaker
    CloudWatch
    IAM
    DMS
    Redshift
    CI/CD

    Big Data & ETL

    Snowflake
    Databricks
    PySpark
    Informatica
    Delta Lake
    Snowpipe
    Unity Catalog
    Z-Ordering
    Auto-Optimize

    Artificial Intelligence

    Large Language Models (LLMs)
    Generative AI
    Prompt Engineering
    RAG (Retrieval Augmented Generation)
    Neural Networks
    NLP & Text Processing
    Vector Databases
    AI Model Deployment

    Machine Learning Algorithms

    Decision Trees
    K-means
    SVM
    Random Forest
    Linear Regression
    XGBoost
    TensorFlow
    PyTorch
    Keras

    Data Analysis & Visualization

    Tableau
    Matplotlib
    Plotly
    Seaborn
    Excel
    A/B Testing
    QuickSight
    PowerBI

    Data Science Libraries

    Scikit-learn
    NumPy
    Pandas
    SciPy
    Langchain
    Streamlit

    Workflow Orchestration

    Jenkins
    n8n
    Hadoop
    Apache Airflow
    Docker

    Version Control

    Git
    AWS Cloud Source Repositories

    Work Experience

    My professional journey and the impact I've made.

    Data Engineer

    Accenture
    Apr 2024–Present

    Led migration of ETL workflows to Databricks, developed data governance frameworks, and implemented AI-driven monitoring systems. Created data optimization strategies in Snowflake and integrated LLMs for metadata tagging and automated reporting, significantly improving processing time, reducing costs, and enhancing data quality and accessibility.

    Data Engineer Associate

    HomeOMattic Service Pvt Ltd
    Apr 2021–Jan 2022

    Designed high-performance ETL pipelines and event-driven architectures that improved data refresh cycles and reduced ingestion delays. Implemented comprehensive workflow orchestration with Apache Airflow and deployed AI-based anomaly detection systems, ensuring high availability and automated monitoring for data pipelines serving thousands of daily users.

    Programme Analyst Trainee

    Cognizant Technology Solutions
    July 2020–Apr 2021

    Built large-scale PySpark pipelines for customer data aggregation and developed predictive models for customer churn analysis. Integrated AWS services to optimize cloud storage costs and created NLP sentiment analysis pipelines, enhancing marketing personalization and analytics capabilities.

    Intern

    Cognizant Technology Solutions
    Jan 2020–May 2020

    Designed Tableau dashboards for compliance metrics and developed data integration workflows with robust validation. Automated metadata cataloging processes and optimized Snowflake query performance, significantly improving reporting efficiency and enabling real-time monitoring.

    Intern

    HomeOMattic Service Pvt Ltd
    Jan 2019–Jan 2020

    Developed machine learning classifiers for fraud detection and built unsupervised anomaly detection algorithms that reduced false positives and manual audits. Created automated model training workflows and visualization dashboards, improving operational efficiency and data accuracy for risk analysis.

    Certifications

    Professional certifications and achievements.

    AWS Data Engineer Associate

    Amazon Web Services

    Sept 2024

    Apache Cassandra 3 Developer

    DataStax

    April 2025

    Databricks Generative AI Fundamentals

    Databricks

    Jan 2025

    Featured Projects

    A selection of my work that showcases my skills and experience.

    About Me

    Beyond the code and data, here's a bit more about who I am.

    Ajith Annavarapu

    Ajith Annavarapu

    I'm a passionate Data Engineer with a Master's degree in Data Science, dedicated to building robust and scalable data systems that transform raw data into valuable insights.

    When I'm not immersed in data, you'll find me exploring new hiking trails, Vibe Coding, experimenting with cooking recipes, or diving into a good thriller movie. I believe in continuous learning and staying curious about the world around us.

    I'm always open to new challenges and collaborations. If you're working on an interesting project that requires data expertise, I'd love to hear about it!

    Location:
    Irving, TX
    Education:
    M.S. in Data Science

    Get In Touch

    Interested in working together? Feel free to reach out!

    Get In Touch

    I'm currently open to freelance opportunities, consulting work, and full-time positions. Don't hesitate to reach out if you think we could work together!