Stanislav Fort

Navedeno

	Vse	Od leta 2019
Navedbe	4671	4639
indeks h	21	21
indeks i10	23	23

1900

950

475

1425

201820192020202120222023202414 36 154 336 611 1661 1830

Javni dostop

Prikaži vse

5 člankov

0 člankov

na voljo

ni na voljo

Na podlagi zahtev v povezavi s financiranjem

Soavtorji

Balaji LakshminarayananSenior Staff Research Scientist at Google DeepMindPreverjeni e-poštni naslov na google.com
Surya GanguliAssociate Professor, Stanford UniversityPreverjeni e-poštni naslov na stanford.edu
Clara Huiyi HuGoogle DeepMindPreverjeni e-poštni naslov na google.com
Stanisław JastrzębskiChief Technology Officer & Chief Scientist @ Molecule.OnePreverjeni e-poštni naslov na molecule.one
Jie RenResearch Scientist at Google BrainPreverjeni e-poštni naslov na google.com
Jeremiah Zhe LiuGoogle Research and Harvard UniversityPreverjeni e-poštni naslov na mail.harvard.edu
Dustin TranResearch Scientist, GooglePreverjeni e-poštni naslov na google.com
Daniel M. RoyResearch Director, Vector Institute; Prof., U. Toronto (Statistics, CS)Preverjeni e-poštni naslov na utoronto.ca
Gintare Karolina DziugaiteGoogle DeepMindPreverjeni e-poštni naslov na google.com
Srini NarayananUC Berkeley and GooglePreverjeni e-poštni naslov na icsi.berkeley.edu
Hui Khoon NgAssoc Prof, Yale-NUS College, and Centre for Quantum Technologies, National University of SingaporePreverjeni e-poštni naslov na nus.edu.sg
Yihui QuekMassachusetts Institute of TechnologyPreverjeni e-poštni naslov na mit.edu
Dan WilkinsResearch Scientist, Stanford UniversityPreverjeni e-poštni naslov na stanford.edu
Jared KaplanJohns Hopkins University & AnthropicPreverjeni e-poštni naslov na pha.jhu.edu
Christopher OlahAnthropicPreverjeni e-poštni naslov na google.com

Spremljaj

Stanislav Fort

Google DeepMind

Preverjeni e-poštni naslov na stanford.edu - Domača stran

machine learning artificial intelligence AI safety


Naslov Razvrsti po navedbah Razvrsti po letniku Razvrsti po naslovu	Navedeno Navedeno	Leto
Dawn Drain, Stanislav Fort, Deep Ganguli, Tom Henighan, et al. Training a helpful and harmless assistant with reinforcement learning from human feedback Y Bai, A Jones, K Ndousse, A Askell, A Chen, N DasSarma arXiv preprint arXiv:2204.05862 1, 2022	979*	2022
Constitutional AI: Harmlessness from AI Feedback Y Bai, S Kadavath, S Kundu, A Askell, J Kernion, A Jones, A Chen, ... arXiv preprint arXiv:2212.08073, 2022	779	2022
Deep Ensembles: A Loss Landscape Perspective S Fort, H Hu, B Lakshminarayanan arXiv preprint arXiv:1912.02757, 2019	620	2019
Exploring the limits of out-of-distribution detection S Fort, J Ren, B Lakshminarayanan Advances in Neural Information Processing Systems 34, 7068-7081, 2021	306	2021
Red teaming language models to reduce harms: Methods, scaling behaviors, and lessons learned D Ganguli, L Lovitt, J Kernion, A Askell, Y Bai, S Kadavath, B Mann, ... arXiv preprint arXiv:2209.07858, 2022	300	2022
Predictability and surprise in large generative models D Ganguli, D Hernandez, L Lovitt, A Askell, Y Bai, A Chen, T Conerly, ... Proceedings of the 2022 ACM Conference on Fairness, Accountability, and …, 2022	221	2022
Training independent subnetworks for robust prediction M Havasi, R Jenatton, S Fort, JZ Liu, J Snoek, B Lakshminarayanan, ... arXiv preprint arXiv:2010.06610, 2020	200	2020
Deep learning versus kernel learning: an empirical study of loss landscape geometry and the time evolution of the neural tangent kernel S Fort, GK Dziugaite, M Paul, S Kharaghani, DM Roy, S Ganguli Advances in Neural Information Processing Systems 33, 5850-5861, 2020	163	2020
A Simple Fix to Mahalanobis Distance for Improving Near-OOD Detection J Ren, S Fort, J Liu, AG Roy, S Padhy, B Lakshminarayanan arXiv preprint arXiv:2106.09022, 2021	160	2021
The Break-Even Point on Optimization Trajectories of Deep Neural Networks S Jastrzebski, M Szymczak, S Fort, D Arpit, J Tabor, K Cho, K Geras arXiv preprint arXiv:2002.09572, 2020	158	2020
Language models (mostly) know what they know S Kadavath, T Conerly, A Askell, T Henighan, D Drain, E Perez, ... arXiv preprint arXiv:2207.05221, 2022	108	2022
Gaussian Prototypical Networks for Few-Shot Learning on Omniglot S Fort arXiv preprint arXiv:1708.02735, 2017	98	2017
Large Scale Structure of Neural Network Loss Landscapes S Fort, S Jastrzebski arXiv preprint arXiv:1906.04724, 2019	84	2019
Stiffness: A new perspective on generalization in neural networks S Fort, PK Nowak, S Jastrzebski, S Narayanan arXiv preprint arXiv:1901.09491, 2019	82	2019
Adaptive quantum state tomography with neural networks Y Quek, S Fort, HK Ng arXiv preprint arXiv:1812.06693, 2018	64	2018
Measuring progress on scalable oversight for large language models SR Bowman, J Hyun, E Perez, E Chen, C Pettit, S Heiner, K Lukošiūtė, ... arXiv preprint arXiv:2211.03540, 2022	58	2022
Discovery of gamma-ray pulsations from the transitional redback PSR J1227-4853 TJ Johnson, PS Ray, J Roy, CC Cheung, AK Harding, HJ Pletsch, S Fort, ... The Astrophysical Journal 806 (1), 91, 2015	58	2015
The goldilocks zone: Towards better understanding of neural network loss landscapes S Fort, A Scherlis Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 3574-3581, 2019	44	2019
Emergent properties of the local geometry of neural loss landscapes S Fort, S Ganguli arXiv preprint arXiv:1910.05929, 2019	42	2019
Analyzing monotonic linear interpolation in neural network loss landscapes J Lucas, J Bae, MR Zhang, S Fort, R Zemel, R Grosse arXiv preprint arXiv:2104.11044, 2021	34*	2021

Sistem trenutno ne more izvesti postopka. Poskusite znova pozneje.

Članki 1–20

Št. navedb na leto

Podvojene navedbe

Združene navedbe

Dodajanje soavtorjevSoavtorji

Spremljaj

Navedeno

Soavtorji