ABCDEFGHIJKLMNOPQRSTUVWX
1
Discord Username on OpenBioMLSTILL INTERESTED IN CONTRIBUTIINGShort introSkillsActive contribution or observer?Tasks I am working on or I could contribute, The list of all the tasks and active tasks is available here: https://github.com/orgs/pinellolab/projects/3/views/1 GITHUB USERNAMEAdded
2
Luca PinelloLuca PinelloyesI am an Associate Professor at Harvard Medical School and MGH in Boston, USAGene regulation, genomics data analyses, genome editing, single-cell omics, machine learning, visualization, web developmentVery active! I started the project and I am planning to spend substaintal time on it1) Define internal metrics to evaluate the quality of the produced sequences 2) Curate datasets that can be used to define conditioning 3) Define plan for experimental validationlucapinellox
3
Lucas Ferreira Lucas FerreirayesI'm a Posdoc Research Fellow at Luca Pinello lab (Harvard Medical School and MGH Boston, USA)Bioinformatics, Gene regulation, Epigenetics, single-cell omics, DL, MLVery active, I prototyped and adapted the initial DNA bitdiffusion model. Currently implementing model evaluation metrics and adapting conditional generation1) Prototyping the first model; Exploring architectures;
2) Creating and testing internal metrics (Ex: motifs distance, FID);
3) Developing and implementing new conditioning variables (ex: motifs);
4) Benchmarking againts other models 5)Define internal metrics to evaluate the quality of the produced sequences
LucasSilvaFerreirax
4
Wouter MeulemanmeulemanyesInvestigator at Altius Institute for Biomedical Sciences and Affiliate Associate Professor at the University of Washington, Seattle WA U.S.A.Genome organization, gene regulation, data visualization, large-scale genomic data analyses, machine learningVery active, provided initial datasets and annotations, co-drafted proposal. This work is essentially a continuation of work in my own group 1) Define the base dataset to use for building predictive and generative models. 2) Curate these datasets into higher-confidence subsets to decrease label noise. 3) Provide guidance and education on the source and experimental assays related to the datasets, the Biological background as it relates to gene regulation, and assumptions regarding behavior/distribution of the data. 4) Evaluate models and their results in the context of existing work, and in silico vs. in vitro experimental validation assays. 5) Formulate testable hypotheses and their evaluations. 6) Capture insights and conclusions into widely accessible scientific writing.meulemanx
5
Zach NussbaumnussyyesML Engineer @ Deep Genomics, Previously at AmazonMachine Learning, Deep Learning, NLP/Transformers, DL applied to GenomicsVery active, helping augment the current training code to allow for conditioningAdding classifier free conditiong for training, augmenting UNet code + verifying implementation is correct, training on motif dataset + cell typezanussbaumx
6
Niccolo' ZanichelliNiccolo' Zanichelli
7
César Miguel Valdez Córdova César Miguel (semibah#0612)yesMaster's student @ JKU LinzDL, ML, Life sciences data (protein/genomics), Causality, Generative Models, Scientific WritingVery active. I want to see the project get completed and will contribute substantially, timewise, to it.Benchmarking, Model exploration, extension and interpretation. General project writing, communicating.cmvcordovax
8
Matei BejanMatei Bejan (Vrööm)yesPhD Student in AI, Deep Learning Research ScientistML, DL, PyTorch, Attention, Diffusion.Very activeVQ-VAE model trianing, UNet refactorization & optimization. Can offer guidance on other tasks (e.g. sampling, scalability). If there are enough people involved and I have the time, I can get involved in these "other tasks" as well.mateibejan1x
9
Sameer Gabbitasameer1yesAI student researcher @ MGHML/DL, PyTorch, Generative ModelingActive - currently developing autoencoder models to potentially diffuse in latent spaceTraining a VQ-VAE for DNA-sequences for stable diffusion; Core model, benchmarking, model explorationsg134x
10
Aaron Wenteleraaron_wyesPhD student @ Queen Mary University of LondonML, DL, Bioinformatics, Genomics, Drug Discovery, InterpretabilityVery interested in actively contributing wherever I can.1) Currently thinking about downstream applications of generative DNA models for drug discovery. Have some ideas about this which I'll put in proper writing. 2) Benchmarking and evaluation 3) Optimizing and refactoring UNET (want to be involved with model engineering and training, but more from the sidelines to learn and help out where I can)aaronwtrx
11
Hassan AhmedData Scientist/ML Engineer at EoN NextML engineering, DL, Experimentation/ Experiment designVery interested in actively contributing wherever I canImplement a sequence quality metric based on k-mer compositionhssn-20
12
Mihir TripathyMihiryesComputer Science Undergrad, DL Researcher in Computational Biology and SLAM. ML, DL, Pytorch, Tensorflow, Keras, Computer Vision and Data EngineeringActively contributing. Refactoring and optimizing the UNET diffusion codemihirnealx
13
Moughees Ahmedgheesyes
engineer at startup in toronto, research in comp bio/genomics @ Emory, bioninformatics, software engineer, NLP and speech research
BioML, anything softwareActiveData building, modellingcodegheesx
14
Ihab BendidiIhabyesPhD candidate at ENS-PSL ParisDeep Learning, bioml, self supervised learningactuve contributorDistributed training on pytorch lightningIhabBendidix
15
Simon SenanssenanyesMSc Data ScienceML, DL, Computer vision relating to medical imagingActive contributer where neededRefactoring and optimizing the UNET diffusion code, implementing dataloaderssenanx
16
Saurav Maheshkarpsych0man#4316yesMachine Learning Engineer, Open Source Contributor, Ex: @Weights & BiasesDeep Learning (Google Developer Expert in ML), Natural Language Processing, Computer VisionWould love to be a active contributorRefactoring the UNET bit diffusion code, Benchmarking against other tools (GAN paper, Transformer implementation), Explore othe architetures for denoisingSauravMaheshkarx
17
Jan Sobotkajan_syesBachelor student of Computer Science @ CTU Prague, ML Engineer @ GeneraliML, DL, reinforcement learning, computer vision, full-stack web development, competitive programmingWould love to actively contribute wherever needed!0) Currently exploring the notebook and data
1) Training a VQ-VAE for DNA-sequences for stable diffusion;
2) Explore othe architetures for denoising;
3) Benchmarking against other tools
Johnny1188x
18
Ethan Cohenethan_cohenYesPhd student @ENS Paris ML, DL, Computer vision, DL applied to BioimagesWould love to actively contribute wherever needed and as much as I canethancohen123x
19
Abraham OwodunniYesResearcher at Sisonke BiotikDL, ML, and a bit of XAIReally want to be activeto be decidedowos
20
Jon ChenJon ChenyesI'm a postdoc in the Collins lab at MIT/Broad studing RNA-small molecule interactions, PhD with David Liu where I built VAEs for generating DNA sequences with small molecule binding capabilitiesML, DL, Pytorch, oligonucleotide folding and binding, VAEs for sequence generationContributionCreate function motif_count Input, general model developmentjoncchen
21
Shashank Yadavxinfyyes1st year PhD student in Biomedical Engineering at UArizona.ML DLObserver and Learnerxinformatics
22
Martino Mansoldomm65yesMachine Learning Engineer, Master in AI @ U of EdinburghPython, PyTorch, Deep Learning, Transformers, GNNs, software engineeringWould like to contributeGetting up to speedmansoldmx
23
Dev VidhaniDevVFull time ML Practitioner in industry for last 4 years - originally focussed on NLP; Currently learning theories to understand DLML, DL, NLP, Multimodal (Vision, NLP)

Diffusion models, VAE models

Optimizers, Loss landscape, bayesian ML

Theoretical understanding of DL

PyTorch
Activedevvidhanix
24
Patrick BryantMixed background,
PhD in protein structure prediction,
Postdoc in Berlin
Various biological problems,
Deep Learning;
https://patrickbryant1.github.io/research.html
Activepatrickbryant1x
25
PeePa1 year experience as a Cloud Engineer, new to ML. Given a failed attempt to recreate brainport in uni. New to learn ML, interest in learning epigenetics with technology that has been used to create brainport (hardware side).Terraform, PythonObserving, really want to be active, and mostly wanting to learnTasks I am working on: Creating a dataloader and unit test. End goals: Learn how this model is built, and get a headstart in creating epigenetics projects JhonSummerx
26
Jae ShinPhD Student, Computational biologybioinformaticsWould like to contributejaewshinx
27
Marlaceznaka301#0779Machine Learning Engineer, Master in CS @ TUMMachine Learning, deep Learning, Medical Imaging, PyTorchInterested in actively contributing but still trying to understand many things around the projectnagam11x
28
StutikingstutML engineer contractor MLWould like to contributeEmbedding so that stable diffusion can be used for this project kingstutx
29
Afshanafshan22ML engineerML, bioinformatics, bio sequence modelsInterested in active contributionGetting up to speed with project so far. Taking notes.nabiafshanx
30
Omar AyyubtentPhD Bioengineering, Biochemical Genetic Disorders, DiagnosticsGenetics, python, pytorch, web developmentWill observe for now until I can contribute in a meaningful manner.obayyubx
31
Giulio Tosatoartificial.giuliobachelor student in artificial intelligence & cognitive science, university
background in math
Pytorch,Keras, Medical imagining (nn-unet), brain computer interface, fast learning and creative mindactively contribuiting wherever my help could me usefulgetting familiar with all thisartificialgiuliox
32
Dashiell StanderML Scientist @ Eleuther AI & StabilityJax & Pytorch, ML Engineering, Diffusion & Score MatchingInterested in actively contributing.
33
Michael PielerOpenBioML and Eleuther AIDL, multimodal CLIP setup, contrastive learning, biotechmulti-GPU training, HPC setup, CLIP setups (if needed)
34
MIhai TodorSoftware Engineer / Tech Lead working on Data Streaming, Scalability and Open Source softwareNo specific background in the field, trying to learn and looking to work on computational biology in the future (epigenomics, immunotherapy for cancer, ageing). I spent many years working with Go, C++, all things Kubernetes etc. Details here: https://www.linkedin.com/in/mtodorMostly observing, would be curious to hear if there's any interest to collaborate on training models and running them at scale
35
Harsha Master's in CS @ IIITHML, DL, Knowledge graphs, Data Driven Drug DiscoveryOberving right now, would love to contribute once i was free
36
David LaubPhD student @ UC San DiegoML/DL for genomicsMostly observing
37
Arsenii ZinkevichPhD student, Bioinformatics, Drug Target DiscoveryML, Bioinformatics, Genomics, Data research, Docker
38
Daniel MaturanaPhD in Biochemistry, experience annotating genomes and regulatory in bacteria. Working with NanoTemperDL learner, python programming, genomic, proteins and interactions studiesMostly observing, but interesting to give feedback, learn and collaborate as much as possible
39
Elizaveta NoskovaMaster's student, bioinformaticianMachine learning, deep learning, Pytorch
40
Matt FisherFounding Engineer @ Vizcom.ai - Generative 3D AI; Eleuther contributor; CRISPR Cas9 HobbyistPyTorch script writing; trainingActive; Genome Auto-encoder (?)
41
Rushikesh ZawarResearch Engineer in Computer Vision(Graduated with B.E. Computer Science & Masters in Biological Sciences)ML/DL, Computer Vision, Reinforcement Learning, Biological Sciences (wet-lab in gen-tech, RDNA and bioinformatics)Observing right now. Would love to contribute, but after a few weeks, because of some other commits
42
James HennessyMl,Bioinformatics,Chemoinformatics, Computer vision, NLP, Data EngineeringWould love to activley contributeWeb scraping, Data set constirctuion, General Coding, model building, ml ops?
43
Tyler KolodyI am a Canadian CS MSc student working on materials generation using diffusion and topological data analysis.Materials Informatics and associated libraries, NLP, generative modeling, Data engineering, web scrapingMore interested in observing, but am happy to contribute to the components that overlap with my own research.General consulting/discussion contribution, web scraping, data representation
44
Wuhao ChenMaster Student in AI at ImperialML, DL, PyTorch, Generative models, GNNWould love to contribute nowModel implementation, training, and experimentation, Exploratory dataset analysis and processing, Refactoring the UNET bit diffusion code, Explore othe architetures for denoising
45
Geo JollyBachelor student of Computer Science, researcher @ucsc and @amritaDL, RL, Generative Models, PyTorchwould like to contributeRefactoring the UNET bit diffusion code
Training a VQ-VAE for DNA-sequences for stable diffusion
46
Karan DaheleSoon to be MD. ML engineer/data scientist at drug discovery/genomics start-up. Applying generative models to genotype dataML/DL (PyTorch), bioinformatics, medicine, writingActiveModel building, protocols for evaluation/analysis of results, writing with biomedical context
47
Derek Neulanddata scientist, researcher @ Puzzle LabsData research, data collection, front-end development, design, project managementVery interested in being active contributor, wherever I can helpResearch, data analysis, collecting and organizing data, creating datasets, cleaning datasets, designing visual components, designing internal workflows, project management
48
Dmitry Penzarphd student, data scientist, bioinformaticiansequence-based expression prediction, SNV effect prediction, classical machine learning and deep learning Interested in actively contributing. model training, model architecture modifications, have several datasets for the evaluation
49
Jonas GrabbeJonas Grabbe#0649data scientist, researcher @ OxcitasMachine Learning, Deep Learning, Geometric DL, Data Driven Drug Discovery Observing for now. Interested in contributing once I have a bit more time.JonasGrabbex
50
Eeshit Dhaval Vaishnavedvyeshttps://www.mit.edu/~edv/https://www.linkedin.com/in/1edvBothAnything necessary for the project.1edvx
51
Nwanna Joseph N.Code.Dev
Data scientist, AI researcher , B.Tech in computer science, Masters in Machine Intelligence. Formerly Senior Android developer, currently Data scientist and Machine Intelligence researcher. Pytorch pro and Contributor to scikit learn
Machine learning, Deep learning, Transformers, Pytorch, Python, Code Contribution Implement evaluation metrics Nwanna-Josephx
52
Phung Cheng Feibuttercutter#1033Data engineerPytorchWould love to actively contribute wherever needed!Anything necessary for the project.buttercutterx
53
Ifty Mohammad RezwanIfty#5354Machine Learning Engineer @ Neovotech, Research Assistant @ NSUDeep Learning (General CV and NLP), Python Scripting, Basic API designActive contributionModelling, Data Gathering and DataLoaders and Scripting if necessary. For now I can try the task (Integrate Maximal Update Parametrization (MuP) for hyperparameter tuning). Seems to be the one open on the board. Can Also Work on upgrading core models a bitimr555
54
Francesco SaccoAScaccoLadJust graduated masters is physicsPhysics, BioML, PytorchI wish to be an Active contributorModellingFrancesco215x
55
Apoorva SrinivasanApoorvayesData scientist, MS in biostats @Columbia UniversityML, GANS, statistics, proteomicsActive contributorapoorvasrinivasan26
56
Noah WebernoahweberyesCTO @ CelerisTxAll things ML & DataActiveCodebase; development and the Algo researchnoahweber1
57
Szilard PolgarSzilard#9346yesdata engineer in finance - biochemistry/ML enthusiastdata pipelines, MLActive from Decemberszilapo
58
Cheyenne ZieglercheyziggyyesPhd candidate in comp bio @ UTD, ML/AI Engineer @ EMD SeronoBioML, anything softwareActiveCurrently working on unit test, really can contribute to anythingceziegler
59
Oliver NashorinocoYesPhD Candidate @ UCL in EngineeringML, Data, Genomics, Product Management, Policy, SafetyBotholivernash
60
Peter Clarkeresurgo#1669Yes
AI/genomics startup CEO/researcher >15 years bioinformatics @sanger, Cambridge Uni, Caltech
PyTorch, Bioinformatics, RNA, Chromatin
Previos observer, now wanting to contribute
fourpartswater
61
Albert WangbertieYesCurrently building with EF, undergrad at McMaster University (biomedical engineering, health sciences, math)Beginner, self-taught ML / AI. Python, MATLAB. Active observor - would love to build skills to a level that I can contribute.Perhaps I can help review papers and apply product management skills while learning technical skills?albertyqw
62
Kieran DidiKieran DidiYesActive contributorkierandidi
63
64
Gabriel DolstengaebYesPhD Student at PrincetonGenomics/MLActive contributor
65
Younwoo (Ethan) ChoifdejongYesCS undergrad @ University of Toronto, student researcher @ Vector InstituteML, NLP, PyTorchActive contributoryounwoochoix
66
Zelun Lil-z-l.mYESBioinformatics Honours student @ UNSW @ Wong lab(victor chang)PyTorch, Bioinformatics, epigeneticsActive contributorl-z-l
67
Aneeqa FatimadaisiesYesCS @ University of Michigan, Master in Applied Math @ University of Washington, Senior SWE @ MSFT on high performance model optimization and kernel development for ASICsML, PyTorch, C++, HPCObserver for now
68
Arnab BhattacharyaArnabYesSoftware Developer (Frontend, Backend microservices, DL, ML)
Pytorch, Tensorflow, React, Node.js, Rust, Go
Active contributorDNA Embedding, API and User facing appsarnab28122000
69
Tian-Lai (Leo) ZangTZYesBME @ Duke, working on diffusion model before
Pytorch, DL, NLP, Protein Design, Microbiology
Wish to be a active contributorDNA Embedding, Guided Diffusion
70
Xuan Hechord233bachelor student in CS,Interested in AI4SciencePytorch,Tensorflow,AI4Science
Observer for now,would like to contribute
chord233
71
Shaishav JaindragonSlayer01#6399YesAssociate Data Science Consultant @ ZS Associates
ML/AI, Pytorch, NLP, Transformers, Stable Diffusion, Bioinformatics
Wish to be a Active contributorWould love to work on the following tasks:
Consider DNA embedding models to transition to use stable diffusion, Implement a sequence quality metric based on existing neural networks that can predict enhancer activity (Bert based), expression (Enformer) or chromatin accessibility (BPnet)
dragonslayer01
72
Talha Khantalk24#4274YesML/AL,Pytorch,Tensorflow,AWS,Webactive contributortalkhanz
73
Michael Leonemjleone#1505YesMD/PhD Student at Carnegie Mellon in Andreas Pfenning Lab
ML, Tensorflow, Epigenomics, snATAC-seq analysis
observer, want to contribute
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100