Data Scientist


This is a Full-time position in New York, NY posted August 29, 2021.

Millennium is a top tier global hedge fund with a strong commitment to leveraging innovations in technology and data science to solve complex problems for the business.

We are expanding our data science practice to take advantage of new tools and techniques in the areas of large scale data analytics and Machine Learning.

We are looking to hire a Data Scientist who will develop large-scale data analytic capabilities to extract insights from structured and unstructured data.

The role will focus on conducting R&D on data-driven solutions for business problems.

The candidate will be expected to manage all aspects of data science projects, from problem formulation to working with developers to productionize solutions.

A strong background in Machine Learning is crucial for the role, along with significant experience in building models in practice.

Experience in text analytics is required; and experience with unstructured data beyond text (images, audio, etc.) is a plus.

Beyond text mining, experience with time series, anomaly detection or graph mining would also be a plus.

Principal Responsibilities
Develop innovative solutions for large scale data sets for developing new business insights Determine the best tools and techniques available Extract data from different data stores (SQL and no-SQL databases, s3, etc.) Pre-process data for consumption by Machine Learning models Feature engineering and feature selection Run evaluations comparing multiple modeling techniques Develop prototypes of tools Work with production teams to deploy models Build relationships within the software engineering and infrastructure teams to develop strategies for supporting and scaling the firms analytics capabilities

Qualifications/Skills Required
Graduate degree specializing in Machine Learning, Data Science or related fields (Natural Language Processing, Information Retrieval, Statistics, Operations Research) 1-3 years of relevant work experience Expert knowledge of large scale data analytics and their applications Experience with ElasticSearch and other AWS services (RDS, S3, EMR, Athena, Spark, Flink, etc.) is a plus Solid written and verbal communications Self-starter able to execute independently with light supervision