Senior Data Scientist | Locals to NC only
- Location: Onsite – Raleigh, North Carolina
- Client: Cognizant (Project with LexisNexis)
- Duration: Long-term contract
- Position ID: 47415-1 / 47414-1
- Please share your resume to raghavan@amxsol.com ASAP.
Required Qualifications
- Strong hands-on experience and foundation in machine learning, including dimensionality reduction, clustering, embeddings, and sequence classification.
- Experience with deep learning frameworks such as PyTorch, TensorFlow, and Hugging Face Transformers.
- Practical experience with NLP libraries and techniques (e.g., spaCy, word2vec, BERT, Keras, Flair).
- Practical experience with LLMs, prompt engineering, fine-tuning, and benchmarking (e.g., LangChain, LlamaIndex).
- Strong proficiency in Python.
- Knowledge of cloud platforms such as AWS, GCP, or Azure.
- Understanding of data modeling and complex data architectures.
- Proficiency in relational and NoSQL databases, and vector stores like Postgres, Elasticsearch/OpenSearch, ChromaDB.
Job Summary
As a Senior Data Scientist at Cognizant, you will drive new product development within a collaborative team, writing production code in both run-time and build-time environments. You'll propose and build data-driven solutions for high-value customer problems, working with large-scale natural language datasets (matter/contract repositories, legal spend data). This role involves prototyping new ideas and collaborating with various technical and legal experts, combining a startup's dynamic culture with an established company's resources. An ideal candidate is passionate about moving beyond Jupyter Notebooks and consistently delivering production-ready code.
Key Responsibilities
- Develop and implement LLM-based applications tailored for in-house legal needs.
- Evaluate and maintain data assets and training/evaluation datasets for integrity and quality.
- Design and build pipelines for preprocessing, annotating, and managing legal document datasets.
- Collaborate with legal experts to understand requirements and ensure models meet domain-specific needs.
- Conduct experiments and evaluate model performance for continuous improvements.
- Evaluate AI/ML and GenAI outcomes (human and automated) for accuracy and business alignment.
- Interface with technical personnel to finalize requirements and deliver outcomes.
- Translate complex product requirements into software designs with development teams.
- Implement development processes, coding best practices, and conduct code reviews for production environments.
Preferred Qualifications
- Experience with Scala, Spark, Ray, or other distributed computing frameworks.
- Knowledge of API development, containerization, and machine learning deployment.
- Familiarity with ML Ops/AI Ops best practices.
—
You received this message because you are subscribed to the Google Groups “sys1point” group.
To unsubscribe from this group and stop receiving emails from it, send an email to sys1point+unsubscribe@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/sys1point/CAKibJ3p_1X-kB_rZnD5soARGkqN7h9PB01uJR8BOo6cZsX-How%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.