JD • Experience: Minimum of 10-12 years. • Data Collection and Processing o Gather data from various sources, including databases, APIs, and external datasets. o Clean and preprocess data to ensure quality and consistency. • Data Analysis o Perform exploratory data analysis (EDA) to uncover patterns, trends, and insights. o Use statistical techniques to analyze data and validate findings. • Model Building o Develop predictive models using machine learning algorithms. o Train and fine-tune models to improve accuracy and performance. Skillset • Python • Spark • Efficient data manipulation • Tree based and NN based model training with large data • Experience with OCR, layout model • File parsing with various file types • LLM, RAG • Experience with using REST API