Machine Learning Research Scientist

Neural Magic

Neural Magic

Software Engineering
Somerville, MA, USA
Posted on Thursday, October 5, 2023

Neural Magic is an early-stage AI software company democratizing high performance for deep learning models. Our goal is to reduce the cost and increase the performance of end-users deploying deep learning applications. Based on decades of research at MIT, Neural Magic has developed a software platform that allows developers to sparsify deep learning models to minimize footprint and run on CPUs at GPU speeds. Please look through our website and GitHub repos to get a feel of what we are about.

Founded by an award-winning team of computer scientists and researchers out of MIT, we are a venture-backed company headquartered in Davis Square, Somerville, MA. Our investors include Amdocs, Andreessen Horowitz, Comcast Ventures, NEA, and Pillar VC.

We are seeking a machine learning research scientist with a proven publication history related to model compression techniques such as pruning and quantization. This person will work closely with our research team to identify, report on, and create new algorithms within the deep learning field. If you are someone who wants to contribute to solving challenging technical problems at the forefront of deep learning, this is the role for you!


  • Use your deep understanding of machine learning to tackle meaningful technical problems
  • Collaborate with product development teams to transfer your ideas into product solutions
  • Perform fundamental research by defining, designing, implementing, and evaluating algorithms
  • Actively engage with the academic community by collaborating with universities, publishing and presenting your work, and attending conferences
  • Proven experience as a machine learning researcher with research publications in the model compression (quantization/pruning) or generative AI / NLG / LLMs space
  • Strong background in deep learning with expertise in one or more of computer vision, NLP, speech, reinforcement learning, generative models, etc
  • Knowledge of common ML frameworks (like PyTorch or Keras) and libraries (like NumPy and scikit-learn)
  • Experience with creating/prototyping algorithms in a popular ML framework such as PyTorch, TensorFlow, jax, etc
  • Strong programming skills with proven experience prototyping and delivering advanced algorithmic solutions
  • Ability to explain and present analysis and machine learning concepts to a broad technical audience
  • Creative, collaborative, and innovation-focused
  • Strong sense of project ownership and personal responsibility
  • Ph.D. in Computer Science, Mathematics or similar field
  • Health Care Plan (Medical, Dental & Vision)
  • Retirement Plan (401k, IRA)
  • Paid Time Off (Vacation, Sick & Public Holidays)
  • Family Leave (Maternity, Paternity)
  • Short Term & Long Term Disability
  • Training & Development
  • Work From Home
  • Free Food & Snacks
  • Wellness Resources
  • Stock Option Plan
We are an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.