Experience with relational databases (SQL, MySQL, Oracle, PostgreSQL).
Experience with big data and non-relational databases (Hive, Hadoop, HDFS, MongoDB).
Process automation and scripting (Python, Airflow, Autosys).
Using Pandas and Regex for data analysis.
Linux and command-line interface experience.
SQL + Spark in Apache Hive for querying/manipulation.
Mining, preparing, cleansing, and analyzing structured data.
Experience in data discovery & identification, data lifecycle management, data classification, data modeling, and privacy policies.