Pf7: an open dataset of Plasmodium falciparum genome variation in 20,000 worldwide samples

Muzamil Mahdi Abdel Hamid, Mohamed Hassan Abdelraheem, Desmond Omane Acheampong, Ambroise Ahouidi, Mozam Ali, Jacob Almagro-Garcia, Alfred Amambua-Ngwa, Chanaki Amaratunga, Lucas Amenga-Etego, Ben Andagalu, Tim Anderson, Voahangy Andrianaranjaka, Ifeyinwa Aniebo, Enoch Aninagyei, Felix Ansah, Patrick O. Ansah, Tobias Apinjoh, Paulo Arnaldo, Elizabeth Ashley, Sarah AuburnGordon A. Awandare, Hampate Ba, Vito Baraka, Alyssa Barry, Philip Bejon, Gwladys I. Bertin, Maciej F. Boni, Steffen Borrmann, Teun Bousema, Marielle Bouyou-Akotet, Oralee Branch, Peter C. Bull, Huch Cheah, Keobouphaphone Chindavongsa, Thanat Chookajorn, Kesinee Chotivanich, Antoine Claessens, David J. Conway, Vladimir Corredor, Erin Courtier, Alister Craig, Umberto D'Alessandro, Souleymane Dama, Nicholas Day, Brigitte Denis, Mehul Dhorda, Mahamadou Diakite, Abdoulaye Djimde, Christiane Dolecek, Arjen Dondorp, Seydou Doumbia, Chris Drakeley, Eleanor Drury, Patrick Duffy, Diego F. Echeverry, Thomas G. Egwang, Sonia Maria Mauricio Enosse, Berhanu Erko, Rick M. Fairhurst, Abdul Faiz, Caterina A. Fanello, Mark Fleharty, Matthew Forbes, Mark Fukuda, Dionicia Gamboa, Anita Ghansah, Lemu Golassa, Sonia Goncalves, G. L. Abby Harrison, Sara Anne Healy, Jason A. Hendry, Anastasia Hernandez-Koutoucheva, Tran Tinh Hien, Catherine A. Hill, Francis Hombhanje, Amanda Hott, Ye Htut, Mazza Hussein, Mallika Imwong, Deus Ishengoma, Scott A. Jackson, Chris G. Jacob, Julia Jeans, Kimberly J. Johnson, Claire Kamaliddin, Edwin Kamau, Jon Keatley, Theerarat Kochakarn, Drissa S. Konate, Abibatou Konaté, Aminatou Kone, Dominic P. Kwiatkowski, Myat P. Kyaw, Dennis Kyle, Mara Lawniczak, Samuel K. Lee, Martha Lemnge, Pharath Lim, Chanthap Lon, Kovana M. Loua, Celine I. Mandara, Jutta Marfurt, Kevin Marsh, Richard James Maude, Mayfong Mayxay, Oumou Maïga-Ascofaré, Olivo Miotto, Toshihiro Mita, Victor Mobegi, Abdelrahim Osman Mohamed, Olugbenga A. Mokuolu, Jaqui Montgomery, Collins Misita Morang’a, Ivo Mueller, Kathryn Murie, Paul N. Newton, Thang Ngo Duc, Thuy Nguyen, Thuy-Nhien Nguyen, Tuyen Nguyen Thi Kim, Hong Nguyen van, Harald Noedl, Francois Nosten, Rintis Noviyanti, Vincent Ntui-Njock Ntui, Alexis Nzila, Lynette Isabella Ochola-Oyier, Harold Ocholla, Abraham Oduro, Irene Omedo, Marie A. Onyamboko, Jean-Bosco Ouedraogo, Kolapo Oyebola, Wellington Aghoghovwia Oyibo, Richard Pearson, Norbert Peshu, Aung P. Phyo, Christopher V. Plowe, Ric N. Price, Sasithon Pukrittayakamee, Huynh Hong Quang, Milijaona Randrianarivelojosia, Julian C. Rayner, Pascal Ringwald, Anna Rosanas-Urgell, Eduard Rovira-Vallbona, Valentin Ruano-Rubio, Lastenia Ruiz, David Saunders, Alex Shayo, Peter Siba, Victoria J. Simpson, Mahamadou S. Sissoko, Christen Smith, Xin-Zhuan Su, Colin Sutherland, Shannon Takala-Harrison, Arthur Talman, Livingstone Tavul, Ngo Viet Thanh, Vandana Thathy, Aung Myint Thu, Mahamoudou Toure, Antoinette Tshefu, Federica Verra, Joseph Vinetz, Thomas E. Wellems, Jason Wendler, Nicholas J. White, Georgia Whitton, William Yavo, Rob W. van der Pluijm

Research output: Contribution to journalArticleAcademicpeer-review

13 Citations (Scopus)

Abstract

We describe the MalariaGEN Pf7 data resource, the seventh release of Plasmodium falciparum genome variation data from the MalariaGEN network.  It comprises over 20,000 samples from 82 partner studies in 33 countries, including several malaria endemic regions that were previously underrepresented.  For the first time we include dried blood spot samples that were sequenced after selective whole genome amplification, necessitating new methods to genotype copy number variations.  We identify a large number of newly emerging crt mutations in parts of Southeast Asia, and show examples of heterogeneities in patterns of drug resistance within Africa and within the Indian subcontinent.  We describe the profile of variations in the C-terminal of the csp gene and relate this to the sequence used in the RTS,S and R21 malaria vaccines.  Pf7 provides high-quality data on genotype calls for 6 million SNPs and short indels, analysis of large deletions that cause failure of rapid diagnostic tests, and systematic characterisation of six major drug resistance loci, all of which can be freely downloaded from the MalariaGEN website.
Original languageEnglish
Article number22
JournalWellcome open research
Volume8
DOIs
Publication statusPublished - 2023
Externally publishedYes

Keywords

  • data resource
  • genomic epidemiology
  • genomics
  • malaria
  • plasmodium falciparum

Cite this