Career Area: Business Technologies, Digital and Data
Job Description:
Your Work Shapes the World at Caterpillar Inc.
When you join Caterpillar, you're joining a global team who cares not just about the work we do – but also about each other. We are the makers, problem solvers, and future world builders who are creating stronger, more sustainable communities. We don't just talk about progress and innovation here – we make it happen, with our customers, where we work and live. Together, we are building a better world, so we can all enjoy living in it.
Caterpillar Inc. seeks Lead Software Engineer at its facility located at 540 W Madison, Chicago, IL 60661.
Focus on data quality evaluation of large set of diverse data from IoT connected assets, our integrated network of dealers, and enterprise data. Contribute to the definition and implementation of quality metrics, identification of data quality rules and evaluation of their impact, as well as root cause analysis of data quality problems. Use analytics and visualization methods to solve problems for Caterpillar internal customers. Have prior experience in a business intelligence or quality role, be proficient in SQL, have development experience in Python and dashboard design. Design, develop, and maintain Dealer and Enterprise quality dashboards and reports. Provide analytics support to high profile Helios Data Division Projects. Use analytics methods to make recommendations to Designers, Product Owners, and Managers. Work independently without close supervision on medium to high complexity projects. Perform all necessary data related tasks including data design, data quality, metadata management, data governance or data architecture. Act as team lead by providing technical leadership, guidance, and assistance to other team members. Work directly on complex problems and provide solutions aligned with overall objectives of Cat Digital. Understand and evaluate business requirements that impact the Caterpillar enterprise, dealership partners, and other internal and external customers. Liaise with data creators to support project planning, training, guidance on standards, and the efficient creation/maintenance of high-quality data. Contribute to policies, procedures, and standards as well as technical requirements. Create the data management strategy and define continuous improvement goals of data quality, governance, and processes. Lead communication across various lines of businesses and technology partners to constantly monitor, collect, review, and roadmap business data requirements, and related technology solutions. Ensure compliance with the latest data standards supported by the company, and brand, legal, information security (data security and privacy compliance). Maintain knowledge of functional technology of the data management systems as they change with upgrades and new releases. Maintain existing data quality scorecard measurements, execution of scorecards, resolving exceptions with data owners and source systems. Design robust, scalable database and data extraction, transformation, and loading (ETL) solutions. Document data models for domains to be deployed including a logical data model, candidate source lists, and canonical formats. Develop data ingestion store specifications identifying source systems and the tables and fields to ingest. Develop data object pipeline specifications including detailed field level mapping, business rules, data quality rules, etc. Create, update, and enhance metadata policies, processes, and catalogs. Drive alignment and coordination of data governance efforts.
Minimum Requirements:
This position requires a Bachelor’s degree, or foreign equivalent, in Information Technology, Computer Science, or a related field plus Eight (8) Years of experience as a Software Developer or related occupation. Alternatively, the employer would accept a Master’s degree, or foreign equivalent, in Information Technology, Computer Science, or a related field, plus Four (4) Years of experience as a Software Developer or related occupation. Additionally, the applicant must have employment experience with:
- Compiling and standardizing diverse, non-sanitized datasets;
- Working with structured and unstructured data; Snowflake, Exadata, Netezza, SAP, Relational Databases, Stored Procedures, Views, Indexing, Window Functions, Normalization;
- Integrating analytical models with existing data pipelines; Linear Regression, Classification, Logistic Regression, Decision Trees, Random Forests, SVM, Hypothesis Testing, K-NN, K-Means Clustering, Text Analysis, Sentiment Analysis, Time Series Analysis, Survival Analysis, A/B Testing, Numpy, Matplotlib, Sckit-learn, Pandas, Scipy;
- Statistical approaches, quantitative analytic methods, data management techniques, and/or related digital technologies, and the ability to handle complex issues;
- AWS full-stack development and services such as: Athena, Glue, DynamoDB, EC2, EMR, RDS, S3, Sage Maker;
- Snowflake data warehouse;
- Visualizing data using BI software such as Tableau and MS Power BI;
- Designing and implementing data processing and machine learning frameworks.
Applicants who are interested in this position should apply via www.caterpillar.com/careers, search [Lead Software Engineer / Reference # R0000270064 ] # LI-DNI.
Posting Dates: August 13, 2024 - December 21, 2024
Any offer of employment is conditioned upon the successful completion of a drug screen.
EEO/AA Employer. All qualified individuals - Including minorities, females, veterans and individuals with disabilities - are encouraged to apply.
Not ready to apply? Join our Talent Community.