{"id":520,"date":"2025-12-28T03:39:53","date_gmt":"2025-12-28T03:39:53","guid":{"rendered":"https:\/\/maxpepsoft.com\/?post_type=jobpost&#038;p=520"},"modified":"2026-04-06T20:57:49","modified_gmt":"2026-04-06T20:57:49","slug":"data-engineer","status":"publish","type":"jobpost","link":"https:\/\/maxpepsoft.com\/index.php\/jobs\/data-engineer\/","title":{"rendered":"Data Engineer"},"content":{"rendered":"<p><span style=\"text-decoration: underline;\"><strong>Posting Date:<\/strong><\/span><\/p>\n<p><strong>04\/06\/2026<\/strong><\/p>\n<p><strong>\u00a0<\/strong><\/p>\n<p><span style=\"text-decoration: underline;\"><strong>POSITION:<\/strong><\/span><\/p>\n<p>Data Engineer<\/p>\n<p>Full-time job, 40 hours per week<\/p>\n<p>Pay\/Salary: $135,533.00 year.<\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"text-decoration: underline;\"><strong>NUMBER OF \u00a0OPENINGS:<\/strong><\/span> 1<\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"text-decoration: underline;\"><strong>LOCATION:<\/strong><\/span><\/p>\n<p>MAXPEP LLC<\/p>\n<p>1071 Cambridge Square, Suite B<\/p>\n<p>Alpharetta, GA 30009<\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"text-decoration: underline;\"><strong>JOB DUTIES:<\/strong><\/span><\/p>\n<ul>\n<li>Load data into Spark RDD, perform advanced procedures like text analytics and processing using in memory data Computation capabilities of Spark.<\/li>\n<li>Leverage Apache Spark on Google Cloud Dataproc, orchestrating Spark jobs written in scala for high-throughput batch processing of transactional and behavioral data.<\/li>\n<li>Build performance-optimize python-based data connectors to ingest external vendors feed directly into GCP storage solutions such as Cloud Storage, triggering downstream Spark jobs.<\/li>\n<li>Develop Python scripts using both Data frames\/SQL and RDD in Spark for Data Aggregation. Develop spark program using Scala APIs for data processing.<\/li>\n<li>Involve in creating Hive tables, loading data and writing Hive queries which will run internally in map reduce.<\/li>\n<li>Implement Partitioning, Dynamic Partitions, Buckets in HIVE.<\/li>\n<li>Develop the code for Importing and exporting data into HDFS and Hive using Sqoop. Implement POC to migrate map reduce jobs into Spark RDD transformations using Scala\/Python.<\/li>\n<li>Develop data pipelines to ingest, process, transform, and load data from various sources into GCP storage and analytics services such as BigQuery, Cloud Storage, and Dataflow.<\/li>\n<li>Work under supervision.<\/li>\n<li>Travel and\/or Relocation to various unanticipated client sites throughout USA is required.<\/li>\n<\/ul>\n<p><span style=\"text-decoration: underline;\"><strong>EDUCATION:<\/strong><\/span><\/p>\n<p>Master\u2019s\u00a0 degree in Computer Science\/Information Technology\/ IS\/Engineering (any)\/Business Analytics or closely related field with Six (6) months of experience in the job offered or as a Data Engineer or IT Consultant or Analyst or Programmer or Developer or Engineer or closely related field.<\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"text-decoration: underline;\"><strong>EXPERIENCE:<\/strong><\/span><\/p>\n<ul>\n<li>Experience of Six (6) months working with Python and Scala is required.<\/li>\n<li>Travel and\/or relocation is required to unanticipated client sites within USA.<\/li>\n<li>International travel is not required.<\/li>\n<li>The frequency of travel is currently not known as it depends on the client and project requirement that cannot be currently anticipated.<\/li>\n<li>Employer provides Information technology services to various clients in USA and hence implementing projects will require such travel.<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Posting Date: 04\/06\/2026 \u00a0 POSITION: Data Engineer Full-time job, 40 hours per week Pay\/Salary: $135,533.00 year. &nbsp; NUMBER OF \u00a0OPENINGS: 1 &nbsp; LOCATION: MAXPEP LLC 1071 Cambridge Square, Suite B Alpharetta, GA 30009 &nbsp; JOB DUTIES: Load data into Spark RDD, perform advanced procedures like text analytics and processing using in memory data Computation capabilities [&hellip;]<\/p>\n","protected":false},"author":12,"featured_media":0,"menu_order":0,"template":"","jobpost_category":[],"jobpost_job_type":[],"jobpost_location":[],"jobpost_tag":[],"class_list":["post-520","jobpost","type-jobpost","status-publish","hentry","entry"],"_links":{"self":[{"href":"https:\/\/maxpepsoft.com\/index.php\/wp-json\/wp\/v2\/jobpost\/520","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/maxpepsoft.com\/index.php\/wp-json\/wp\/v2\/jobpost"}],"about":[{"href":"https:\/\/maxpepsoft.com\/index.php\/wp-json\/wp\/v2\/types\/jobpost"}],"author":[{"embeddable":true,"href":"https:\/\/maxpepsoft.com\/index.php\/wp-json\/wp\/v2\/users\/12"}],"wp:attachment":[{"href":"https:\/\/maxpepsoft.com\/index.php\/wp-json\/wp\/v2\/media?parent=520"}],"wp:term":[{"taxonomy":"jobpost_category","embeddable":true,"href":"https:\/\/maxpepsoft.com\/index.php\/wp-json\/wp\/v2\/jobpost_category?post=520"},{"taxonomy":"jobpost_job_type","embeddable":true,"href":"https:\/\/maxpepsoft.com\/index.php\/wp-json\/wp\/v2\/jobpost_job_type?post=520"},{"taxonomy":"jobpost_location","embeddable":true,"href":"https:\/\/maxpepsoft.com\/index.php\/wp-json\/wp\/v2\/jobpost_location?post=520"},{"taxonomy":"jobpost_tag","embeddable":true,"href":"https:\/\/maxpepsoft.com\/index.php\/wp-json\/wp\/v2\/jobpost_tag?post=520"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}