Python, R, SAS, Hadoop, Machine Learning, SQL, Natural Language Processing (NLP), NLP tools (such as NLTK), Apache Lucene, Solr, Graph analytics tools: GraphX, Apache Giraph, Deep Learning tools such as Caffe, Torch
Located in Chicago, IL we are searching for a Lead Data Scientist with NPL, NLTK, Apache Lucene, Solr, Data sets experience with their Master's Degree to add to our growing team. We are the leading company in our industry and we need someone to work with data-sets of varying degrees of size and complexity including both structured and unstructured data. Are you local to Chicago? Are you looking to make an impact on a growing team for an industry leading company? Please apply today!
Top Reasons to Work with Us
1.) Full paid benefits
2.) Leading company in the healthcare industry
What You Will Be Doing
Primary duties may include are but not limited to: Design machine learning projects to address specific business problems determined by consultation with business partners. Piping and processing massive data-streams in distributed computing environments such as Hadoop to facilitate analysis. Implements batch and real-time model scoring to drive actions. Develops proprietary machine learning algorithms to build customized solutions that go beyond standard industry tools and lead to innovative solutions. Develop sophisticated visualization of analysis output for business users. Publish results and address constraints/limitations with business partners. Provides high-level controllership/evaluation of all output produced to ensure established targets are met. Determines the continuous improvement opportunities of current predictive modeling algorithms.
What You Need for this Position
*Advanced expertise with software such as Python, R, SAS. Programming experience in Python is strongly preferred.
*Experience working with distributed computing environment such as Hadoop.
*Intermediate to Advanced knowledge of using with Hive, Impala or Apache Spark.
*Intermediate to Advanced knowledge of data extraction and manipulation using SQL.
* Master's or PhD in Statistics, Computer Science, Mathematics, Machine Learning, Econometrics, Physics, Biostatistics or related
* 2+ year's experience utilizing NLP applications such as topic models and sentiment analysis to identify patterns within data sets is strongly preferred.
* Experience using open source NLP tools (such as NLTK), Apache Lucene, Solr, etc. is preferred.
* 3-5 years in Predictive Analytics
Located in Chicago, IL we are searching for a Lead Data Scientist with NPL, NLTK, Apache Lucene, Solr, Data sets experience with their Master's Degree to add to our growing team. We are the leading company in our industry and we need someone to work with data-sets of varying degrees of size and complexity including both structured and unstructured data. Are you local to Chicago? Are you looking to make an impact on a growing team for an industry leading company? Please apply today!
Top Reasons to Work with Us
1.) Full paid benefits
2.) Leading company in the healthcare industry
What You Will Be Doing
Primary duties may include are but not limited to: Design machine learning projects to address specific business problems determined by consultation with business partners. Piping and processing massive data-streams in distributed computing environments such as Hadoop to facilitate analysis. Implements batch and real-time model scoring to drive actions. Develops proprietary machine learning algorithms to build customized solutions that go beyond standard industry tools and lead to innovative solutions. Develop sophisticated visualization of analysis output for business users. Publish results and address constraints/limitations with business partners. Provides high-level controllership/evaluation of all output produced to ensure established targets are met. Determines the continuous improvement opportunities of current predictive modeling algorithms.
What You Need for this Position
*Advanced expertise with software such as Python, R, SAS. Programming experience in Python is strongly preferred.
*Experience working with distributed computing environment such as Hadoop.
*Intermediate to Advanced knowledge of using with Hive, Impala or Apache Spark.
*Intermediate to Advanced knowledge of data extraction and manipulation using SQL.
* Master's or PhD in Statistics, Computer Science, Mathematics, Machine Learning, Econometrics, Physics, Biostatistics or related
* 2+ year's experience utilizing NLP applications such as topic models and sentiment analysis to identify patterns within data sets is strongly preferred.
* Experience using open source NLP tools (such as NLTK), Apache Lucene, Solr, etc. is preferred.
* 3-5 years in Predictive Analytics