I am an AI safety researcher currently investigating LLM deception at MATS. My previous work has involved improving preference modeling for scalable oversight and formally proving expressivity relationships between different RL formalisms.
-
Member Of Technical StaffOpenaiSweden -
Research ScholarMl Alignment & Theory Scholars Sep 2024 - PresentLondon Area, United Kingdom(MATS extension program) Continuing my work on targeted LLM deception and manipulation. Paper: https://arxiv.org/abs/2411.02306 -
Research ScholarMl Alignment & Theory Scholars Jun 2024 - Sep 2024Berkeley, California, United StatesWrote the paper "Targeted Manipulation and Deception Emerge when Optimizing LLMs for User Feedback" which looks at how deceptive and manipulative LLM behavior can arise from imperfect feedback, particularly user feedback. -
Ai Alignment Researcher (Ltff Grant)Independent Nov 2023 - Jun 2024Long-Term Future Fund funded project which showed switching to a multi-objective reward function in the preference model of a RL from AI feedback system improves performance. Paper: https://arxiv.org/abs/2406.07295
-
Course FacilitatorAi Safety Fundamentals Aug 2023 - Dec 2023Lund, Skåne County, SwedenFacilitating multiple in person groups for the AI Safety Fundamentals Alignment course -
ResearcherAi Safety Hub Jul 2023 - Oct 2023Oxford, England, United KingdomWorking on comparing the expressivities of different Reinforcement Learning formalisms. Traditional Markov rewards cannot express objectives like risk aversion, max-min or lexicographic priorities which could be important for training safe agents. This work investigates other formalisms such as Multi-Objective RL, Reward Machines, Linear Temporal Logic and Convex RL among many others, to see which of these can express all possible tasks expressible by another formalism. -
Masters Thesis - Detecting Bone Marrow Particles In Blood Using Vision Transformers And CnnsCellavision Jan 2023 - Jun 2023Lund, Skåne County, SwedenI worked on creating a system to detect, classify and segment bone marrow fragments in blood. This included automating the focusing and image acquisition process from bone marrow aspirate smears and training several vision transformers and CNNs. -
Software Developer In Mobile ApplicationsAxis Communications 2019 - 2021Lund, Skåne County, SwedenI developed a program that visualized how users interact with Axis' mobile applications, enhancing our understanding of user behavior and leading to improvements in the overall user experience. I also analyzed error spikes and abnormal start-up times or latencies, contributing to the improvement of our mobile applications. I gained experience working with SQL databases, BigQuery, and Looker Studio.
Marcus Williams Education Details
-
5.0/5.0 (Perfect Grades In Every Course)
Frequently Asked Questions about Marcus Williams
What company does Marcus Williams work for?
Marcus Williams works for Openai
What is Marcus Williams's role at the current company?
Marcus Williams's current role is Member of Technical Staff.
What schools did Marcus Williams attend?
Marcus Williams attended The Faculty Of Engineering At Lund University.
Who are Marcus Williams's colleagues?
Marcus Williams's colleagues are Jose Deivid, Hossem Ben Ayed, Daisy Yang, Mikhail P., Hitoshi Kawaharada, Vladimir Petrov, Harsh Koshta.
Not the Marcus Williams you were looking for?
-
-
Marcus Williams
Cheltenham2craneaerospace.com, mirus-as.com -
Marcus Williams
Global Product Manager / Specialist In Clinical Diagnostics With A Successful Record Of Delivering Large Scale Reproductive Health And Mass Spectrometry Projects.United Kingdom1perkinelmer.com -
Marcus Williams
Greater London -
Marcus Williams
Warrington
Free Chrome Extension
Find emails, phones & company data instantly
Aero Online
Your AI prospecting assistant
Select data to include:
0 records × $0.02 per record
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial