MicroBinfie Podcast, 82 Bioinformatics moments before the millennium - MicroBinfie Podcast

MicroBinfie Podcast, 82 Bioinformatics moments before the millennium

In this episode we talk to Professor Mark Pallen, who discusses the highlights from his long career as a medical microbiologist turned bioinformatician.

His bioinformatics journey began in 1977, the year Fred Sanger invented DNA sequencing-as-we know-it, when Mark was tasked with assembling some amino acid sequences under exam conditions. Mark explains how little was know about sequences at the time. Luckily he managed to gain a grasp of molecular biology and joined a group in the late 1908s at Barts Hospital in London, where he met Brendan Wren. Mark’s first eureka moment followed shortly afterwards, when he analysed sequences encoding the key enzyme urease from Helicobacter pylori. He also got very excited when he analysed genes from a clostridial butanol fermentation pathway, which he explains, played a central role in the formation of the state of Israel. His next big break came when he got the chance to do a PhD under Gordon Dougan. During this time, Mark not only improved his lab and bioinformatics skills, but captained a winning team in University Challenge and introduced the medical profession to the Internet. He recalls with excitement the moment when he first heard the news that a bacterial genome had been sequenced. Shortly afterwards he recruited an 18-year old gap year student, Nick Loman, to come and work with him analysing the very first Campylobacter jejuni genome. We close this episode just as the new millennium begins, with much more excitement to follow in the next episode.

Relevant links:

  • Butanol - https://academic.oup.com/femsle/article/124/1/61/486499
  • Tree-like thinking for genes, languages and gospel manuscripts - https://www.youtube.com/watch?v=8Ykj5wQs7vU

Selective bibliography

  • Konings, F, Perkins, MD, Kuhn, JH, Pallen, MJ, Alm, EJ, Archer, BN, Barakat, A, Bedford, T, Bhiman, JN, Caly, L, Carter, LL, Cullinane, A, de Oliveira, T, Druce, J, El Masry, I, Evans, R, Gao, GF, Gorbalenya, AE, Hamblion, E, Herring, BL, Hodcroft, E, Holmes, EC, Kakkar, M, Khare, S, Koopmans, MPG, Korber, B, Leite, J, MacCannell, D, Marklewitz, M, Maurer-Stroh, S, Rico, JAM, Munster, VJ, Neher, R, Munnink, BO, Pavlin, BI, Peiris, M, Poon, L, Pybus, O, Rambaut, A, Resende, P, Subissi, L, Thiel, V, Tong, S, van der Werf, S, von Gottberg, A, Ziebuhr, J, Van Kerkhove, MD (2021) SARS-CoV-2 Variants of Interest and Concern naming scheme conducive for global discourse Nature Microbiology 6 (7) 821 823
  • Pallen, MJ, Alikhan, NF (2021) Naming the Unnamed: Over 45,000 Candidatus Names for Unnamed Archaea and Bacteria in the Genome Taxonomy Database Preprints https://doi.org/10.20944/preprints2021110557v1
  • Pallen MJ (2021) Bacterial nomenclature in the era of genomics New Microbes New Infect 44 100942 https://doi.org/10.1016/jnmni2021100942
  • Pallen, MJ, Telatin, A, Oren, A (2021) The Next Million Names for Archaea and Bacteria Trends in Microbiology 29 (4) 289 298 https://doi.org/10.1016/jtim202010009
  • Gilroy, R, Ravi, A, Getino, M, Pursley, I, Horton, DL, Alikhan, N-F, Baker, D, Gharbi, K, Hall, N, Watson, M, Adriaenssens, EM, Foster-Nyarko, E, Jarju, S, Secka, A, Antonio, M, Oren, A, Chaudhuri, RR, Ragione, RL, Hildebrand, F, Pallen, MJ (2021) Extensive microbial diversity within the chicken gut microbiome revealed by metagenomics and culture PeerJ 9 e10941 https://doi.org/10.7717/peerj10941
  • Ravi, A, Halstead, FD, Bamford, A, Casey, A, Thomson, NM, Van Schaik, W, Snelson, C, Goulden, R, Foster-Nyarko, E, Savva, GM, Whitehouse, T, Pallen, MJ, Oppenheim, BA (2019) Loss of microbial diversity and pathogen domination of the gut microbiota in critically ill patients Microbial Genomics 5 (9) 293 https://doi.org/10.1099/mgen0000293
  • Connor, TR, Loman, NJ, Thompson, S, Smith, A, Southgate, J, Poplawski, R, Bull, MJ, Richardson, E, Ismail, M, Thompson, SE, Kitchen, C, Guest, M, Bakke, M, Sheppard, SK, Pallen, MJ (2016) CLIMB (the Cloud Infrastructure for Microbial Bioinformatics): an online resource for the medical microbiology community Microbial genomics 2 (9) e000086 https://doi.org/10.1099/mgen0000086
  • Loman NJ, Pallen MJ (2015) Twenty years of bacterial genome sequencing. Nat Rev Microbiol 13:787–794. https://doi.org/10.1038/nrmicro3565
  • Smith O, Momber G, Bates R, Garwood P, Fitch S, Pallen M*, Gaffney V, Allaby RG (2015) Sedimentary DNA from a submerged site reveals wheat in the British Isles 8000 years ago. Science 347:998–1001. https://doi.org/10.1126/science.1261278
  • Kay GL, Sergeant MJ, Zhou Z, Chan JZ, Millard A, Quick J, Szikossy I, Pap I, Spigelman M, Loman NJ, Achtman M, Donoghue HD, Pallen MJ (2015) Eighteenth-century genomes show that mixed infections were common at time of peak tuberculosis in Europe. Nat
  • Halachev MR, Chan JZ, Constantinidou CI, Cumley N, Bradley C, Smith-Banks M, Oppenheim B, Pallen MJ (2014) Genomic epidemiology of a protracted hospital outbreak caused by multidrug-resistant Acinetobacter baumannii in Birmingham, England. Genome Med 6:70. https://doi.org/10.1186/s13073-014-0070-x
  • Sergeant MJ, Constantinidou C, Cogan TA, Bedford MR, Penn CW, Pallen MJ (2014) Extensive microbial and functional diversity within the chicken cecal microbiome. PLoS One 9:e91941. https://doi.org/10.1371/journal.pone.0091941
  • Pallen MJ (2014) Diagnostic metagenomics: potential applications to bacterial, viral and parasitic infections. Parasitology 141:1856–1862. https://doi.org/10.1017/S0031182014000134 Citations: 103
  • Doughty EL, Sergeant MJ, Adetifa I, Antonio M, Pallen MJ (2014) Culture-independent detection and characterisation of Mycobacterium tuberculosis and M. africanum in sputum samples using shotgun metagenomics on a benchtop sequencer. PeerJ 2:e585. https://doi.org/10.7717/peerj.585
  • Chan JZ, Sergeant MJ, Lee OY, Minnikin DE, Besra GS, Pap I, Spigelman M, Donoghue HD, Pallen MJ (2013) Metagenomic analysis of tuberculosis in a mummy. N Engl J Med 369(3):289–290. https://doi.org/10.1056/NEJMc1302295
  • Robinson ER, Walker TM, Pallen MJ (2013) Genomics and outbreak investigation: from sequence to consequence. Genome Med 5:36. https://doi.org/10.1186/gm440
  • Loman NJ, Constantinidou C, Christner M, Rohde H, Chan JZ, Quick J, Weir JC, Quince C, Smith GP, Betley JR, Aepfelbacher M, Pallen MJ (2013) A culture-independent sequence-based metagenomics approach to the investigation of an outbreak of Shiga-toxigenic Escherichia coli O104:H4. JAMA 309:1502–1510. https://doi.org/10.1001/jama.2013.3231
  • Loman NJ, Misra RV, Dallman TJ, Constantinidou C, Gharbia SE, Wain J, Pallen MJ (2012) Performance comparison of benchtop high-throughput sequencing platforms. Nat Biotechnol 30:434–439. https://doi.org/10.1038/nbt.2198
  • Loman NJ, Constantinidou C, Chan JZ, Halachev M, Sergeant M, Penn CW, Robinson ER, Pallen MJ (2012) High-throughput bacterial genome sequencing: an embarrassment of choice, a world of opportunity. Nat Rev Microbiol 10:599–606. https://doi.org/10.1038/nrmicro2850
  • Chan JZ, Pallen MJ, Oppenheim B, Constantinidou C (2012) Genome sequencing in clinical microbiology. Nat Biotechnol 30:1068–1071. https://doi.org/10.1038/nbt.2410
  • Chan JZ, Halachev MR, Loman NJ, Constantinidou C, Pallen MJ (2012) Defining bacterial species in the genomic era: insights from the genus Acinetobacter. BMC Microbiol 12:302. https://doi.org/10.1186/1471-2180-12-302
  • Mol Microbiol 79:468–483. https://doi.org/10.1111/j.1365-2958.2010.07460.x
  • Rohde H, Qin J, Cui Y, Li D, Loman NJ, Hentschke M, Chen W, Pu F, Peng Y, Li J, Xi F, Li S, Li Y, Zhang Z, Yang X, Zhao M, Wang P, Guan Y, Cen Z, Zhao X, Christner M, Kobbe R, Loos S, Oh J, Yang L, Danchin A, Gao GF, Song Y, Li Y, Yang H, Wang J, Xu J, Pallen MJ*, Wang J, Aepfelbacher M, Yang R (2011) Open-source genomic analysis of Shiga-toxin-producing E. coli O104:H4. N Engl J Med 365:718–724. https://doi.org/10.1056/NEJMoa1107643
  • Lewis T, Loman NJ, Bingle L, Jumaa P, Weinstock GM, Mortiboy D, Pallen MJ (2010) High-throughput whole-genome sequencing to dissect the epidemiology of Acinetobacter baumannii isolates from a hospital outbreak. J Hosp Infect 75:37–41. https://doi.org/10.1016/j.jhin.2010.01.012
  • Pallen MJ (2010) High-throughput sequencing as a tool in the discovery, diagnosis and epidemiology of nosocomial pathogens. Journal of Hospital Infection 76:S1. https://doi.org/10.1016/S0195-6701(10)00376-2
  • Pallen MJ, Loman NJ, Penn CW (2010) High-throughput sequencing and clinical microbiology: progress, opportunities and challenges. Curr Opin Microbiol 13:625–631. https://doi.org/10.1016/j.mib.2010.08.003
  • Pallen MJ, Nelson KE, Preston GM (2007) Bacterial Pathogenomics (ASM Press, Washington, DC, [Edited Book]
  • Pallen MJ (2007) Introduction to Pathogenomics in Bacterial Pathogenomics, eds Pallen
  • Tobe T, Beatson SA, Taniguchi H, Abe H, Bailey CM, Fivian A, Younis R, Matthews S, Marches O, Frankel G, Hayashi T, Pallen MJ (2006) An extensive repertoire of type III secretion effectors in Escherichia coli O157 and the role of lambdoid phages in their dissemination. Proc Natl Acad Sci U S A 103:14941–14946. https://doi.org/10.1073/pnas.0604891103
  • Pallen MJ, Beatson SA, Bailey CM (2005) Bioinformatics, genomics and evolution of non-flagellar type-III secretion systems: a Darwinian perspective. FEMS Microbiol Rev 29:201–229. https://doi.org/10.1016/j.femsre.2005.01.001
  • Pallen MJ, Beatson SA, Bailey CM (2005) Bioinformatics analysis of the locus for enterocyte effacement provides novel insights into type-III secretion. BMC Microbiol 5:9. https://doi.org/10.1186/1471-2180-5-9
  • Pallen MJ, Penn CW, Chaudhuri RR (2005) Bacterial flagellar diversity in the post-genomic era. Trends in Microbiology 13:143–149. https://doi.org/10.1016/j.tim.2005.02.008
  • Ideses D, Gophna U, Paitan Y, Chaudhuri RR, Pallen MJ, Ron EZ (2005) A degenerate type III secretion system from septicemic Escherichia coli contributes to pathogenesis. J Bacteriol 187:8164–8171. https://doi.org/10.1128/JB.187.23.8164-8171.2005
  • Ren CP, Chaudhuri RR, Fivian A, Bailey CM, Antonio M, Barnes WM, Pallen MJ (2004) The ETT2 gene cluster, encoding a second type III secretion system from Escherichia coli, is present in the majority of strains but has undergone widespread mutational attrition. J Bacteriol 186:3547–3560. https://doi.org/10.1128/JB.186.11.3547-3560.2004
  • Zhang L, Chaudhuri RR, Constantinidou C, Hobman JL, Patel MD, Jones AC, Sarti D, Roe AJ, Vlisidou I, Shaw RK, Falciani F, Stevens MP, Gally DL, Knutton S, Frankel G, Penn CW, Pallen MJ (2004) Regulators encoded in the Escherichia coli type III secretion system 2 gene cluster influence expression of genes within the locus for enterocyte effacement in enterohemorrhagic E. coli O157:H7. Infect Immun 72:7282–7293. https://doi.org/10.1128/IAI.72.12.7282-7293.2004
  • Chaudhuri RR, Khan AM, Pallen MJ (2004) coliBASE: an online database for Escherichia coli, Shigella and Salmonella comparative genomics. Nucleic Acids Res 32:D296–9. https://doi.org/10.1093/nar/gkh031
  • Cerdeno-Tarraga AM, Efstratiou A, Dover LG, Holden MT, Pallen M, Bentley SD, Besra GS, Churcher C, James KD, De Zoysa A, Chillingworth T, Cronin A, Dowd L, Feltwell T, Hamlin N, Holroyd S, Jagels K, Moule S, Quail MA, Rabbinowitsch E, Rutherford KM, Thomson NR, Unwin L, Whitehead S, Barrell BG, Parkhill J (2003) The complete genome sequence and analysis of Corynebacterium diphtheriae NCTC13129. Nucleic Acids Res 31:6516–6523. https://doi.org/10.1093/nar/gkg874
  • Pallen MJ, Francis MS, Futterer K (2003) Tetratricopeptide-like repeats in type-III-secretion chaperones and regulators. FEMS Microbiol Lett 223:53–60. https://doi.org/10.1016/S0378-1097(03)00344-6
  • Bentley SD, Maiwald M, Murphy LD, Pallen MJ, Yeats CA, Dover LG, Norbertczak HT, Besra GS, Quail MA, Harris DE, von Herbay A, Goble A, Rutter S, Squares R, Squares S, Barrell BG, Parkhill J, Relman DA (2003) Sequencing and analysis of the genome of the Whipple’s disease bacterium Tropheryma whipplei. Lancet 361:637–644. https://doi.org/10.1016/S0140-6736(03)12597-4
  • Roche FM, Massey R, Peacock SJ, Day NP, Visai L, Speziale P, Lam A, Pallen M, Foster TJ (2003) Characterization of novel LPXTG-containing proteins of Staphylococcus aureus identified from genome sequences. Microbiology 149:643–654. https://doi.org/10.1099/mic.0.25996-0
  • Pallen MJ (2002) The ESAT-6/WXG100 superfamily – and a new Gram-positive secretion system? Trends in Microbiology 10:209–212. https://doi.org/10.1016/S0966-842X(02)02345-4
  • Spiers A, Lamb HK, Cocklin S, Wheeler KA, Budworth J, Dodds AL, Pallen MJ, Maskell DJ, Charles IG, Hawkins AR (2002) PDZ domains facilitate binding of high temperature requirement protease A (HtrA) and tail-specific protease (Tsp) to heterologous substrates through recognition of the small stable RNA A (ssrA)-encoded peptide. J Biol Chem 277:39443–39449. https://doi.org/10.1074/jbc.M202790200
  • Pallen M (2002) From sequence to consequence: In silico hypothesis generation and testing. Methods in Microbiology 33:27–48. https://doi.org/10.1016/S0580-9517(02)33003-4
  • Pallen M, Chaudhuri R, Khan A (2002) Bacterial FHA domains: neglected players in the phospho-threonine signalling game? Trends in Microbiology 10:556–563. https://doi.org/10.1016/S0966-842X(02)02476-9
  • Pallen MJ, Lam AC, Loman N (2001) Tricorn-like proteases in bacteria. Trends in Microbiology 9:518–521. https://doi.org/10.1016/S0966-842X(01)02199-0
  • Pallen MJ, Lam AC, Antonio M, Dunbar K (2001) An embarrassment of sortases - a richness of substrates? Trends in Microbiology 9:97–102. https://doi.org/10.1016/S0966-842X(01)01956-4
  • Pallen MJ, Lam AC, Loman NJ, McBride A (2001) An abundance of bacterial ADP-ribosyltransferases–implications for the origin of exotoxins and their human homologues. Trends in Microbiology 9:302–7; discussion 308. https://doi.org/10.1016/S0966-842X(01)02074-1
  • Parkhill J, Wren BW, Mungall K, Ketley JM, Churcher C, Basham D, Chillingworth T, Davies RM, Feltwell T, Holroyd S, Jagels K, Karlyshev AV, Moule S, Pallen MJ, Penn CW, Quail MA, Rajandream MA, Rutherford KM, van Vliet AH, Whitehead S, Barrell BG (2000) The genome sequence of the food-borne pathogen Campylobacter jejuni reveals hypervariable sequences. Nature 403:665–668. https://doi.org/10.1038/35001088
  • Ponting CP, Pallen MJ (1999) beta-propeller repeats and a PDZ domain in the tricorn protease: predicted self-compartmentalisation and C-terminal polypeptide-binding strategies of substrate selection. FEMS Microbiol Lett 179:447–451.
  • Ponting CP, Pallen MJ (1999) A beta-propeller domain within TolB. Mol Microbiol 31:739–740. https://doi.org/10.1046/j.1365-2958.1999.01168.x
  • Pallen MJ, Ponting CP (1997) PDZ domains in bacterial proteins. Mol Microbiol 26(2):411–413. https://doi.org/10.1046/j.1365-2958.1997.5591911.x
  • Pallen MJ (1996) Medicine and the Internet: dreams, nightmares and reality. Br J Hosp Med 56:506–509.
  • Pallen M, Kiley R (1996) Medical Information on the Internet. British Medical Journal 313:1411. https://doi.org/10.1136/bmj.313.7069.1411a
  • Mullany P, Pallen M, Wilks M, Stephen JR, Tabaqchali S (1996) A group II intron in a conjugative transposon from the gram-positive bacterium, Clostridium difficile. Gene 174:145–150. https://doi.org/10.1016/0378-1119(96)00511-2
  • Pallen MJ (1995) Medicine and the internet: dreams, nightmares and reality. British journal of hospital medicine 56:506–509.
  • Pallen M (1995) Guide to the Internet: Introducing the Internet. British Medical Journal 311:1422–1424. https://doi.org/10.1136/bmj.311.7017.1422
  • Pallen M (1995) Guide to the Internet: Electronic mail. British Medical Journal 311:1487–1490. https://doi.org/10.1136/bmj.311.7018.1487
  • Pallen M (1995) Guide to the Internet. The world wide web. British Medical Journal 311:1552–1556. https://doi.org/10.1136/bmj.311.7019.1552
  • Pallen M (1995) Guide to the Internet. Logging in, fetching files, reading news. British Medical Journal 311:1626–1630. https://doi.org/10.1136/bmj.311.7020.1626
  • Pallen MJ, Hay AJ, Puckey LH, Efstratiou A (1994) Polymerase chain reaction for screening clinical isolates of corynebacteria for the production of diphtheria toxin. J Clin Pathol 47:353–356. https://doi.org/10.1136/jcp.47.4.353
  • Mullany P, Clayton CL, Pallen MJ, Slone R, al-Saleh A, Tabaqchali S (1994) Genes encoding homologues of three consecutive enzymes in the butyrate/butanol-producing pathway of Clostridium acetobutylicum are clustered on the Clostridium difficile chromosome. FEMS Microbiol Lett 124:61–67. https://doi.org/10.1111/j.1574-6968.1994.tb07262.x
  • Clayton CL, Pallen MJ, Kleanthous H, Wren BW, Tabaqchali S (1990) Nucleotide sequence of two genes from Helicobacter pylori encoding for urease subunits. Nucleic Acids Res 18:362. https://doi.org/10.1093/nar/18.2.362