Play all audios:
ABSTRACT Lakes are heterogeneous ecosystems inhabited by a rich microbiome whose genomic diversity is poorly defined. We present a continental-scale study of metagenomes representing 6.5
million km2 of the most lake-rich landscape on Earth. Analysis of 308 Canadian lakes resulted in a metagenome-assembled genome (MAG) catalogue of 1,008 mostly novel bacterial genomospecies.
Lake trophic state was a leading driver of taxonomic and functional diversity among MAG assemblages, reflecting the responses of communities profiled by 16S rRNA amplicons and gene-centric
metagenomics. Coupling the MAG catalogue with watershed geomatics revealed terrestrial influences of soils and land use on assemblages. Agriculture and human population density were drivers
of turnover, indicating detectable anthropogenic imprints on lake bacteria at the continental scale. The sensitivity of bacterial assemblages to human impact reinforces lakes as sentinels of
environmental change. Overall, the LakePulse MAG catalogue greatly expands the freshwater genomic landscape, advancing an integrative view of diversity across Earth’s microbiomes. Access
through your institution Buy or subscribe This is a preview of subscription content, access via your institution ACCESS OPTIONS Access through your institution Access Nature and 54 other
Nature Portfolio journals Get Nature+, our best-value online-access subscription $29.99 / 30 days cancel any time Learn more Subscribe to this journal Receive 12 digital issues and online
access to articles $119.00 per year only $9.92 per issue Learn more Buy this article * Purchase on SpringerLink * Instant access to full article PDF Buy now Prices may be subject to local
taxes which are calculated during checkout ADDITIONAL ACCESS OPTIONS: * Log in * Learn about institutional subscriptions * Read our FAQs * Contact customer support SIMILAR CONTENT BEING
VIEWED BY OTHERS DEPTH-DISCRETE METAGENOMICS REVEALS THE ROLES OF MICROBES IN BIOGEOCHEMICAL CYCLING IN THE TROPICAL FRESHWATER LAKE TANGANYIKA Article Open access 09 February 2021 TWO
DECADES OF BACTERIAL ECOLOGY AND EVOLUTION IN A FRESHWATER LAKE Article 03 January 2025 A GENOME AND GENE CATALOG OF THE AQUATIC MICROBIOMES OF THE TIBETAN PLATEAU Article Open access 16
February 2024 DATA AVAILABILITY Raw metagenome reads were archived in the European Nucleotide Archive under study accession PRJEB29238. Metagenome co-assemblies were deposited and annotated
at the Joint Genome Institute (JGI) Genomes OnLine Database (GOLD) under study accession Gs0136026 and analysis projects Ga0495746 (Boreal/Taiga Cordilleras), Ga0495744 (Montane Cordillera),
Ga0495745 (Pacific Maritime), Ga0495743 (Taiga Plains), Ga0485099 (Semi-Arid Plateaux), Ga0485102 (Boreal Plains), Ga0485100 (Prairies), Ga0364548 (Mixedwood Plains), Ga0373103 (Boreal
Shield), Ga0372599 (Atlantic Highlands) and Ga0372598 (Atlantic Maritime). MAGs from co-assemblies and associated annotations were deposited in Dryad112 and at the European Nucleotide
Archive under study accession PRJEB62834. Publicly available datasets used in this study include HydroLakes v.1.0 (https://www.hydrosheds.org/products/hydrolakes), SoilGrids250m
(https://www.soilgrids.org/), ERA5-Land (https://doi.org/10.24381/cds.e2161bac), Canada Ecozones v.5b (https://ccea-ccae.org/ecozones-downloads/), Rfam v.14.2
(https://ftp.ebi.ac.uk/pub/databases/Rfam/14.2/), Transporter Classification Database downloaded on 27 January 2021 (https://tcdb.org/public/tcdb), dbCAN2 HMMdb v.9
(https://bcb.unl.edu/dbCAN2/download/dbCAN-HMMdb-V9.txt), GTDB r95 (https://data.gtdb.ecogenomic.org/releases/release95/) and DADA2-formatted 16S rRNA gene sequences
(GTDB_bac120_arc122_ssu_r95.fa.gz; https://doi.org/10.5281/zenodo.3951383). Source data are provided with this paper. CODE AVAILABILITY Scripts associated with this study are available at
https://github.com/rebeccagarner/lakepulse_mags. CHANGE HISTORY * _ 11 OCTOBER 2023 A Correction to this paper has been published: https://doi.org/10.1038/s41564-023-01515-7 _ REFERENCES *
Newton, R. J., Jones, S. E., Eiler, A., McMahon, K. D. & Bertilsson, S. A guide to the natural history of freshwater lake bacteria. _Microbiol. Mol. Biol. Rev._ 75, 14–49 (2011). Article
CAS PubMed PubMed Central Google Scholar * Pernthaler, J. Competition and niche separation of pelagic bacteria in freshwater habitats. _Environ. Microbiol._ 19, 2133–2150 (2017).
Article CAS PubMed Google Scholar * Bertilsson, S. & Mehrshad, M. in _Encyclopedia of Inland Waters_ (eds Mehner, T. & Tockner, K.) 601–615 (Elsevier, 2022). * Kratz, T. K.,
MacIntyre, S. & Webster, K. E. in _Ecosystem Function in Heterogeneous Landscapes_ (eds Lovett, G. M. et al.) 329–347 (Springer, 2005). * Tranvik, L. J. et al. Lakes and reservoirs as
regulators of carbon cycling and climate. _Limnol. Oceanogr._ 54, 2298–2314 (2009). Article CAS Google Scholar * Williamson, C. E., Dodds, W., Kratz, T. K. & Palmer, M. A. Lakes and
streams as sentinels of environmental change in terrestrial and atmospheric processes. _Front. Ecol. Environ._ 6, 247–254 (2008). Article Google Scholar * Grossart, H., Massana, R.,
McMahon, K. D. & Walsh, D. A. Linking metagenomics to aquatic microbial ecology and biogeochemical cycles. _Limnol. Oceanogr._ 65, S2–S20 (2020). Article CAS Google Scholar *
O’Reilly, C. M. et al. Rapid and highly variable warming of lake surface waters around the globe. _Geophys. Res. Lett._ 42, 10773–10781 (2015). Google Scholar * Smith, V. H. &
Schindler, D. W. Eutrophication science: where do we go from here? _Trends Ecol. Evol._ 24, 201–207 (2009). Article PubMed Google Scholar * Jane, S. F. et al. Widespread deoxygenation of
temperate lakes. _Nature_ 594, 66–70 (2021). Article CAS PubMed Google Scholar * Dugan, H. A. et al. Lakes at risk of chloride contamination. _Environ. Sci. Technol._ 54, 6639–6650
(2020). Article CAS PubMed Google Scholar * Reid, A. J. et al. Emerging threats and persistent conservation challenges for freshwater biodiversity. _Biol. Rev._ 94, 849–873 (2019).
Article PubMed Google Scholar * Sheridan, E. A. et al. Plastic pollution fosters more microbial growth in lakes than natural organic matter. _Nat. Commun._ 13, 4175 (2022). Article CAS
PubMed PubMed Central Google Scholar * Tyson, G. W. et al. Community structure and metabolism through reconstruction of microbial genomes from the environment. _Nature_ 428, 37–43 (2004).
Article CAS PubMed Google Scholar * Hug, L. A. et al. A new view of the tree of life. _Nat. Microbiol._ 1, 16048 (2016). Article CAS PubMed Google Scholar * Cabello-Yeves, P. J. et
al. Genomes of novel microbial lineages assembled from the sub-ice waters of Lake Baikal. _Appl. Environ. Microbiol._ 84, e02132-17 (2018). Article PubMed Google Scholar * Okazaki, Y.,
Nakano, S., Toyoda, A. & Tamaki, H. Long-read-resolved, ecosystem-wide exploration of nucleotide and structural microdiversity of lake bacterioplankton genomes. _mSystems_ 7, e0043322
(2022). Article PubMed Google Scholar * Mehrshad, M. et al. Hidden in plain sight—highly abundant and diverse planktonic freshwater Chloroflexi. _Microbiome_ 6, 176 (2018). Article
PubMed PubMed Central Google Scholar * Arora-Williams, K. et al. Dynamics of microbial populations mediating biogeochemical cycling in a freshwater lake. _Microbiome_ 6, 165 (2018).
Article PubMed PubMed Central Google Scholar * Tran, P. Q. et al. Depth-discrete metagenomics reveals the roles of microbes in biogeochemical cycling in the tropical freshwater Lake
Tanganyika. _ISME J._ 15, 1971–1986 (2021). Article CAS PubMed PubMed Central Google Scholar * Rodriguez‐R, L. M., Tsementzi, D., Luo, C. & Konstantinidis, K. T. Iterative
subtractive binning of freshwater chronoseries metagenomes identifies over 400 novel species and their ecologic preferences. _Environ. Microbiol._ 22, 3394–3412 (2020). Article PubMed
Google Scholar * Kavagutti, V. S. et al. High-resolution metagenomic reconstruction of the freshwater spring bloom. _Microbiome_ 11, 15 (2023). Article PubMed PubMed Central Google
Scholar * Tully, B. J., Graham, E. D. & Heidelberg, J. F. The reconstruction of 2,631 draft metagenome-assembled genomes from the global oceans. _Sci. Data_ 5, 170203 (2018). Article
CAS PubMed PubMed Central Google Scholar * Anantharaman, K. et al. Thousands of microbial genomes shed light on interconnected biogeochemical processes in an aquifer system. _Nat.
Commun._ 7, 13219 (2016). Article CAS PubMed PubMed Central Google Scholar * Almeida, A. et al. A new genomic blueprint of the human gut microbiota. _Nature_ 568, 499–504 (2019).
Article CAS PubMed PubMed Central Google Scholar * Nayfach, S., Shi, Z. J., Seshadri, R., Pollard, K. S. & Kyrpides, N. C. New insights from uncultivated genomes of the global human
gut microbiome. _Nature_ 568, 505–510 (2019). Article CAS PubMed PubMed Central Google Scholar * Pasolli, E. et al. Extensive unexplored human microbiome diversity revealed by over
150,000 genomes from metagenomes spanning age, geography, and lifestyle. _Cell_ 176, 649–662.e20 (2019). Article CAS PubMed PubMed Central Google Scholar * Nayfach, S. et al. A genomic
catalog of Earth’s microbiomes. _Nat. Biotechnol._ 39, 499–509 (2021). Article CAS PubMed Google Scholar * Buck, M. et al. Comprehensive dataset of shotgun metagenomes from oxygen
stratified freshwater lakes and ponds. _Sci. Data_ 8, 131 (2021). Article CAS PubMed PubMed Central Google Scholar * Messager, M. L., Lehner, B., Grill, G., Nedeva, I. & Schmitt, O.
Estimating the volume and age of water stored in global lakes using a geo-statistical approach. _Nat. Commun._ 7, 13603 (2016). Article CAS PubMed PubMed Central Google Scholar * Huot,
Y. et al. The NSERC Canadian Lake Pulse Network: a national assessment of lake health providing science for water management in a changing climate. _Sci. Total Environ._ 695, 133668 (2019).
Article CAS PubMed Google Scholar * Kraemer, S. A. et al. A large-scale assessment of lakes reveals a pervasive signal of land use on bacterial communities. _ISME J._ 14, 3011–3023
(2020). Article CAS PubMed PubMed Central Google Scholar * Wight, J., Varin, M.-P., Robertson, G. J., Huot, Y. & Lang, A. S. Microbiology in the field: construction and validation
of a portable incubator for real-time quantification of coliforms and other bacteria. _Front. Public Health_ 8, 607997 (2020). Article PubMed PubMed Central Google Scholar * Garner, R.
E., Gregory-Eaves, I. & Walsh, D. A. Sediment metagenomes as time capsules of lake microbiomes. _mSphere_ 5, e00512-20 (2020). Article CAS PubMed PubMed Central Google Scholar *
Garner, R. E. et al. Protist diversity and metabolic strategy in freshwater lakes are shaped by trophic state and watershed land use on a continental scale. _mSystems_ 7, e0031622 (2022).
Article PubMed Google Scholar * MacKeigan, P. W. et al. Comparing microscopy and DNA metabarcoding techniques for identifying cyanobacteria assemblages across hundreds of lakes. _Harmful
Algae_ 113, 102187 (2022). Article CAS PubMed Google Scholar * Oliva, A., Garner, R. E., Walsh, D. & Huot, Y. The occurrence of potentially pathogenic fungi and protists in Canadian
lakes predicted using geomatics, in situ and satellite-derived variables: towards a tele-epidemiological approach. _Water Res._ 209, 117935 (2021). Article PubMed Google Scholar *
Griffiths, K. et al. Pervasive changes in algal indicators since pre-industrial times: a paleolimnological study of changes in primary production and diatom assemblages from ~200 Canadian
lakes. _Sci. Total Environ._ 838, 155938 (2022). Article CAS PubMed Google Scholar * Kraemer, S. A., Barbosa da Costa, N., Oliva, A., Huot, Y. & Walsh, D. A. A resistome survey
across hundreds of freshwater bacterial communities reveals the impacts of veterinary and human antibiotics use. _Front. Microbiol._ 13, 995418 (2022). Article PubMed PubMed Central
Google Scholar * Oliva, A. et al. Geospatial analysis reveals a hotspot of fecal bacteria in Canadian prairie lakes linked to agricultural non-point sources. _Water Res._ 231, 119596
(2023). Article CAS PubMed Google Scholar * Sánchez Schacht, J. R., MacKeigan, P. W., Taranu, Z. E., Huot, Y. & Gregory-Eaves, I. Agricultural land use and lake morphology explain
substantial variation in water quality across Canada. Preprint at _bioRxiv_ https://doi.org/10.1101/2022.08.29.505280 (2023). * Albertsen, M. et al. Genome sequences of rare, uncultured
bacteria obtained by differential coverage binning of multiple metagenomes. _Nat. Biotechnol._ 31, 533–538 (2013). Article CAS PubMed Google Scholar * Bowers, R. M. et al. Minimum
information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea. _Nat. Biotechnol._ 35, 725–731 (2017). Article CAS PubMed PubMed
Central Google Scholar * Fernández-Gómez, B. et al. Ecology of marine Bacteroidetes: a comparative genomics approach. _ISME J._ 7, 1026–1037 (2013). Article PubMed PubMed Central Google
Scholar * De Corte, D. et al. Metagenomic insights into zooplankton-associated bacterial communities. _Environ. Microbiol._ 20, 492–505 (2018). Article PubMed Google Scholar * Ezzedine,
J. A., Desdevises, Y. & Jacquet, S. Bdellovibrio and like organisms: current understanding and knowledge gaps of the smallest cellular hunters of the microbial world. _Crit. Rev.
Microbiol._ 48, 428–449 (2022). Article PubMed Google Scholar * Brown, C. T. et al.Unusual biology across a group comprising more than 15% of domain bacteria. _Nature_ 523, 208–211
(2015). Article CAS PubMed Google Scholar * Berg, I. A. Ecological aspects of the distribution of different autotrophic CO2 fixation pathways. _Appl. Environ. Microbiol._ 77, 1925–1936
(2011). Article CAS PubMed PubMed Central Google Scholar * He, C. et al. Genome-resolved metagenomics reveals site-specific diversity of episymbiotic CPR bacteria and DPANN archaea in
groundwater ecosystems. _Nat. Microbiol._ 6, 354–365 (2021). Article CAS PubMed PubMed Central Google Scholar * Tian, R. et al. Small and mighty: adaptation of superphylum
Patescibacteria to groundwater environment drives their genome simplicity. _Microbiome_ 8, 51 (2020). Article CAS PubMed PubMed Central Google Scholar * Ferrier, S., Manion, G., Elith,
J. & Richardson, K. Using generalized dissimilarity modelling to analyse and predict patterns of beta diversity in regional biodiversity assessment. _Divers. Distrib._ 13, 252–264
(2007). Article Google Scholar * Sunagawa, S. et al. Structure and function of the global ocean microbiome. _Science_ 348, 1261359 (2015). Article PubMed Google Scholar *
Delgado-Baquerizo, M. et al. A global atlas of the dominant bacteria found in soil. _Science_ 359, 320–325 (2018). Article CAS PubMed Google Scholar * Hahn, M. W., Huemer, A., Pitt, A.
& Hoetzinger, M. Opening a next‐generation black box: ecological trends for hundreds of species‐like taxa uncovered within a single bacterial >99% 16S rRNA operational taxonomic unit.
_Mol. Ecol. Resour._ 21, 2471–2485 (2021). Article CAS PubMed PubMed Central Google Scholar * Azan, S. S. E. & Arnott, S. E. The impact of calcium decline on population growth
rates of crustacean zooplankton in Canadian Shield lakes. _Limnol. Oceanogr._ 63, 602–616 (2018). Article CAS Google Scholar * Weyhenmeyer, G. A. et al. Widespread diminishing
anthropogenic effects on calcium in freshwaters. _Sci. Rep._ 9, 10450 (2019). Article PubMed PubMed Central Google Scholar * Kaushal, S. S. et al. Freshwater salinization syndrome on a
continental scale. _Proc. Natl Acad. Sci. USA_ 115, E574–E583 (2018). Article CAS PubMed PubMed Central Google Scholar * McKee, L. S. et al. Polysaccharide degradation by the
Bacteroidetes: mechanisms and nomenclature. _Environ. Microbiol. Rep._ 13, 559–581 (2021). Article CAS PubMed Google Scholar * He, S. et al. Ecophysiology of freshwater Verrucomicrobia
inferred from metagenome-assembled genomes. _mSphere_ 2, e00277-17 (2017). Article PubMed PubMed Central Google Scholar * Harding, C. J. et al. A lysozyme with altered substrate
specificity facilitates prey cell exit by the periplasmic predator _Bdellovibrio bacteriovorus_. _Nat. Commun._ 11, 4817 (2020). Article CAS PubMed PubMed Central Google Scholar *
Berlemont, R. & Martiny, A. C. Glycoside hydrolases across environmental microbial communities. _PLoS Comput. Biol._ 12, e1005300 (2016). Article PubMed PubMed Central Google Scholar
* Garron, M.-L. & Henrissat, B. The continuing expansion of CAZymes and their families. _Curr. Opin. Chem. Biol._ 53, 82–87 (2019). Article CAS PubMed Google Scholar * Delgado, A.
& Gómez, J. A. in _Principles of Agronomy for Sustainable Agriculture_ (eds Villalobos, F. J. & Fereres, E.) 15–26 (Springer, 2016). * Marmen, S. et al. The role of land use types
and water chemical properties in structuring the microbiomes of a connected lake system. _Front. Microbiol._ 11, 89 (2020). Article PubMed PubMed Central Google Scholar * Sperlea, T. et
al. The relationship between land cover and microbial community composition in European lakes. _Sci. Total Environ._ 825, 153732 (2022). Article CAS PubMed Google Scholar * Phale, P. S.,
Sharma, A. & Gautam, K. in _Pharmaceuticals and Personal Care Products: Waste Management and Treatment Technology_ (eds Prasad, M. N. V. et al.) 259–278 (Butterworth-Heinemann, 2019). *
Foley, J. A. et al. Global consequences of land use. _Science_ 309, 570–574 (2005). Article CAS PubMed Google Scholar * Kaushal, S. S. et al. Land use and climate variability amplify
carbon, nutrient, and contaminant pulses: a review with management implications. _J. Am. Water Resour. Assoc._ 50, 585–614 (2014). Article CAS Google Scholar * Salcher, M. M., Schaefle,
D., Kaspar, M., Neuenschwander, S. M. & Ghai, R. Evolution in action: habitat transition from sediment to the pelagial leads to genome streamlining in Methylophilaceae. _ISME J._ 13,
2764–2777 (2019). Article CAS PubMed PubMed Central Google Scholar * Biessy, L., Pearman, J. K., Waters, S., Vandergoes, M. J. & Wood, S. A. Metagenomic insights to the functional
potential of sediment microbial communities in freshwater lakes. _Metabarcoding Metagenom._ 6, e79265 (2022). Article Google Scholar * Linz, A. M. et al. Freshwater carbon and nutrient
cycles revealed through reconstructed population genomes. _PeerJ_ 6, e6075 (2018). Article PubMed PubMed Central Google Scholar * Neuenschwander, S. M., Ghai, R., Pernthaler, J. &
Salcher, M. M. Microdiversification in genome-streamlined ubiquitous freshwater Actinobacteria. _ISME J._ 12, 185–198 (2018). Article CAS PubMed Google Scholar * NSERC Canadian Lake
Pulse Network. _NSERC Canadian Lake Pulse Network Field Manual 2017–2018–2019 Surveys_ (eds Varin, M.-P. et al.) https://doi.org/10.17118/11143/18662 (Université de Sherbrooke, 2021). *
Hengl, T. et al. SoilGrids250m: global gridded soil information based on machine learning. _PLoS ONE_ 12, e0169748 (2017). Article PubMed PubMed Central Google Scholar * Muñoz Sabater,
J. _ERA5-Land Hourly Data from 1981 to Present_ (Copernicus Climate Change Service (C3S) Climate Data Store (CDS), 2019). * _Canadian Environmental Quality Guidelines. Canadian Water Quality
Guidelines for the Protection of Aquatic Life. Phosphorus: Canadian Guidance Framework for the Management of Freshwater Systems_ (Canadian Council of Ministers of the Environment, 2004). *
_Canada Ecozones_ (Canadian Council on Ecological Areas, 2014); https://ccea-ccae.org/ecozones-downloads/ * Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for
Illumina sequence data. _Bioinformatics_ 30, 2114–2120 (2014). Article CAS PubMed PubMed Central Google Scholar * Li, D. et al. MEGAHIT v1.0: a fast and scalable metagenome assembler
driven by advanced methodologies and community practices. _Methods_ 102, 3–11 (2016). Article CAS PubMed Google Scholar * Li, H. & Durbin, R. Fast and accurate short read alignment
with Burrows-Wheeler transform. _Bioinformatics_ 25, 1754–1760 (2009). Article CAS PubMed PubMed Central Google Scholar * Kang, D. D. et al. MetaBAT 2: an adaptive binning algorithm for
robust and efficient genome reconstruction from metagenome assemblies. _PeerJ_ 7, e7359 (2019). Article PubMed PubMed Central Google Scholar * Parks, D. H., Imelfort, M., Skennerton, C.
T., Hugenholtz, P. & Tyson, G. W. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. _Genome Res._ 25, 1043–1055 (2015). Article
CAS PubMed PubMed Central Google Scholar * Olm, M. R., Brown, C. T., Brooks, B. & Banfield, J. F. dRep: a tool for fast and accurate genomic comparisons that enables improved
genome recovery from metagenomes through de-replication. _ISME J._ 11, 2864–2868 (2017). Article CAS PubMed PubMed Central Google Scholar * Seemann, T. Prokka: rapid prokaryotic genome
annotation. _Bioinformatics_ 30, 2068–2069 (2014). Article CAS PubMed Google Scholar * Hyatt, D. et al. Prodigal: prokaryotic gene recognition and translation initiation site
identification. _BMC Bioinformatics_ 11, 119 (2010). Article PubMed PubMed Central Google Scholar * Nawrocki, E. P. & Eddy, S. R. Infernal 1.1: 100-fold faster RNA homology searches.
_Bioinformatics_ 29, 2933–2935 (2013). Article CAS PubMed PubMed Central Google Scholar * Kalvari, I. et al. Rfam 14: expanded coverage of metagenomic, viral and microRNA families.
_Nucleic Acids Res._ 49, D192–D200 (2021). Article CAS PubMed Google Scholar * Aramaki, T. et al.KofamKOALA: KEGG ortholog assignment based on profile HMM and adaptive score threshold.
_Bioinformatics_ 36, 2251–2252 (2020). Article CAS PubMed Google Scholar * Saier, M. H. et al. The Transporter Classification Database (TCDB): recent advances. _Nucleic Acids Res._ 44,
D372–D379 (2016). Article CAS PubMed Google Scholar * Finn, R. D., Clements, J. & Eddy, S. R. HMMER web server: interactive sequence similarity searching. _Nucleic Acids Res._ 39,
W29–W37 (2011). Article CAS PubMed PubMed Central Google Scholar * Zhang, H. et al. dbCAN2: a meta server for automated carbohydrate-active enzyme annotation. _Nucleic Acids Res._ 46,
W95–W101 (2018). Article CAS PubMed PubMed Central Google Scholar * Terrapon, N., Lombard, V., Gilbert, H. J. & Henrissat, B. Automatic prediction of polysaccharide utilization loci
in Bacteroidetes species. _Bioinformatics_ 31, 647–655 (2015). Article CAS PubMed Google Scholar * Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing
genomic features. _Bioinformatics_ 26, 841–842 (2010). Article CAS PubMed PubMed Central Google Scholar * Bushnell, B. BBMap (SourceForge, 2015). * Jain, C., Rodriguez-R, L. M.,
Phillippy, A. M., Konstantinidis, K. T. & Aluru, S. High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries. _Nat. Commun._ 9, 5114 (2018). Article
PubMed PubMed Central Google Scholar * Li, H. et al. The Sequence Alignment/Map format and SAMtools. _Bioinformatics_ 25, 2078–2079 (2009). Article PubMed PubMed Central Google Scholar
* Nayfach, S. & Pollard, K. S. Average genome size estimation improves comparative metagenomics and sheds light on the functional ecology of the human microbiome. _Genome Biol._ 16, 51
(2015). Article PubMed PubMed Central Google Scholar * Castro, J. C. et al. imGLAD: accurate detection and quantification of target organisms in metagenomes. _PeerJ_ 6, e5882 (2018).
Article PubMed PubMed Central Google Scholar * Chaumeil, P.-A., Mussig, A. J., Hugenholtz, P. & Parks, D. H. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database.
_Bioinformatics_ 36, 1925–1927 (2019). Article PubMed PubMed Central Google Scholar * Yu, G., Smith, D. K., Zhu, H., Guan, Y. & Lam, T. T. ggtree: an R package for visualization and
annotation of phylogenetic trees with their covariates and other associated data. _Methods Ecol. Evol._ 8, 28–36 (2017). Article Google Scholar * Nurk, S., Meleshko, D., Korobeynikov, A.
& Pevzner, P. A. metaSPAdes: a new versatile metagenomic assembler. _Genome Res._ 27, 824–834 (2017). Article CAS PubMed PubMed Central Google Scholar * Huntemann, M. et al. The
standard operating procedure of the DOE-JGI Metagenome Annotation Pipeline (MAP v.4). _Stand. Genomic Sci._ 11, 17 (2016). Article PubMed PubMed Central Google Scholar * Caporaso, J. G.
et al. Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample. _Proc. Natl Acad. Sci. USA_ 108, 4516–4522 (2011). Article CAS PubMed Google Scholar *
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. _EMBnet J._ 17, 10–12 (2011). Article Google Scholar * Callahan, B. J. et al. DADA2: high-resolution
sample inference from Illumina amplicon data. _Nat. Methods_ 13, 581–583 (2016). Article CAS PubMed PubMed Central Google Scholar * Alishum, A. DADA2 formatted 16S rRNA gene sequences
for both bacteria & archaea. _Zenodo_ https://doi.org/10.5281/zenodo.3951383 (2020). * Oksanen, J. et al. vegan: an R package for community ecologists (CRAN, 2020). * Paradis, E. &
Schliep, K. ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. _Bioinformatics_ 35, 526–528 (2019). Article CAS PubMed Google Scholar * Fitzpatrick, M. C.
et al. gdm: generalized dissimilarity modeling (CRAN, 2021). * R Core Team. _R: A Language and Environment for Statistical Computing_ (R Foundation for Statistical Computing, 2021). *
Wickham, H. et al. Welcome to the tidyverse. _J. Open Source Softw._ 4, 1686 (2019). Article Google Scholar * Garner, R. et al. The LakePulse Metagenome-Assembled Genome catalogue. _Dryad_
https://doi.org/10.5061/dryad.zkh1893fs (2023). Download references ACKNOWLEDGEMENTS This study was funded by the NSERC Canadian LakePulse Network (Strategic Network Grant NETGP-479720) and
Canada Research Chairs held by D.A.W. and Y.H. R.E.G. and V.E.O. received scholarships from the NSERC CREATE ÉcoLac Training Program in Lake and Fluvial Ecology. R.E.G. also received
support from a Fonds de recherche du Québec _–_ Nature et technologies Doctoral Research Scholarship and the Stephen Bronfman Graduate Scholarship in Environmental Studies. The pan-Canadian
field campaigns were made possible through the immense commitment and efforts of the LakePulse sampling crews and support teams, and the cooperation and assistance of Indigenous groups,
municipal and park employees, lake associations and landowners. We thank the LakePulse researchers involved in the generation, management and quality control of the dataset, and acknowledge
contributions from J. Juric, P. MacKeigan, C. Paquette, M. Beaulieu, K. Griffiths, G. Potvin, B. Cremella, G. Diab, V. Fugère, J. Kim, A. Oliva and K. Velghe. We also thank W. Brookes, S.
Simpson and Compute Canada for bioinformatics technical support. AUTHOR INFORMATION AUTHORS AND AFFILIATIONS * Department of Biology, Concordia University, Montreal, Quebec, Canada Rebecca
E. Garner, Vera E. Onana & David A. Walsh * Groupe de recherche interuniversitaire en limnologie, Montreal, Quebec, Canada Rebecca E. Garner, Vera E. Onana, Yannick Huot & David A.
Walsh * Environment and Climate Change Canada, Montreal, Quebec, Canada Susanne A. Kraemer * Département de géomatique appliquée, Université de Sherbrooke, Sherbrooke, Quebec, Canada Maxime
Fradette, Marie-Pierre Varin & Yannick Huot Authors * Rebecca E. Garner View author publications You can also search for this author inPubMed Google Scholar * Susanne A. Kraemer View
author publications You can also search for this author inPubMed Google Scholar * Vera E. Onana View author publications You can also search for this author inPubMed Google Scholar * Maxime
Fradette View author publications You can also search for this author inPubMed Google Scholar * Marie-Pierre Varin View author publications You can also search for this author inPubMed
Google Scholar * Yannick Huot View author publications You can also search for this author inPubMed Google Scholar * David A. Walsh View author publications You can also search for this
author inPubMed Google Scholar CONTRIBUTIONS D.A.W. conceived the study and designed the analyses with R.E.G. R.E.G., S.A.K. and V.E.O. performed the bioinformatic analyses. M.F. performed
the geomatic analyses. M.-P.V. coordinated the LakePulse field campaigns and data quality control. Y.H. conceived and directed the LakePulse Network. R.E.G. and D.A.W. wrote the manuscript.
All authors read and improved the final manuscript. CORRESPONDING AUTHOR Correspondence to David A. Walsh. ETHICS DECLARATIONS COMPETING INTERESTS The authors declare no competing interests.
PEER REVIEW PEER REVIEW INFORMATION _Nature Microbiology_ thanks Luis Rodriguez-R, Helmut Bürgmann and the other, anonymous, reviewer(s) for their contribution to the peer review of this
work. Peer reviewer reports are available. ADDITIONAL INFORMATION PUBLISHER’S NOTE Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional
affiliations. EXTENDED DATA EXTENDED DATA FIG. 1 Project workflow illustrating the generation and analysis of the LakePulse MAG catalogue. EXTENDED DATA FIG. 2 Distributions of (A)
geography, (B) lake morphometry, (C) watershed soil, (D) land use, (E) climate, and (F) surface water physicochemistry variables. Colours represent ecozones and ecozone medians are indicated
by dashed lines. EXTENDED DATA FIG. 3 Number of MAGs generated within each ecozone co-assembly. EXTENDED DATA FIG. 4 Comparison of taxonomic diversity in MAG and 16S rRNA gene ASV datasets:
percentages of MAGs and ASVs assigned to (A) phyla and (B) orders. EXTENDED DATA FIG. 5 Phylum-level biogeographic distributions of MAGs. EXTENDED DATA FIG. 6 HIERARCHICAL CLUSTERING
(WARD’S LINKAGE) OF MAG PHYLUM-LEVEL TAXONOMIC ASSEMBLAGES. Letter symbols above each MAG assemblage (scaled to relative TAD80) represent ecozones and coloured squares represent lake
chlorophyll-_a_ concentration. EXTENDED DATA FIG. 7 PCoAs of (A) the taxonomic variation among ASV assemblages based on Jaccard dissimilarities and (B) the variation in metabolic gene
content among metagenomes based on Bray-Curtis dissimilarities. EXTENDED DATA FIG. 8 SIGNIFICANT PREDICTORS OF (A) TAXONOMIC AND FUNCTIONAL TURNOVER ACROSS MAG ASSEMBLAGES AND COMMUNITY
PROFILES AND (B) TURNOVER IN SPECIFIC METABOLIC FUNCTIONS ACROSS MAG ASSEMBLAGES. The height of bars represents the relative importance of predictors within the GDM. MAG assemblage turnover
is shown above zero and community (ASV or metagenome) turnover is shown below zero. EXTENDED DATA FIG. 9 GENE MAPS OF POLYSACCHARIDE UTILIZATION LOCI (PULS) IDENTIFIED ACROSS _BACTEROIDOTA_
MAGS. Arrows indicate gene directions. Colours represent gene types (SusCD, CAZymes, tRNA genes, other KOs). EXTENDED DATA FIG. 10 PCoA showing the variation in xenobiotics biodegradation
and metabolism among MAG assemblages. SUPPLEMENTARY INFORMATION REPORTING SUMMARY SUPPLEMENTARY TABLES 1–4 Supplementary Table 1: Summary of metagenome information: lake names, accession
information, sampling coordinates and assembly characteristics. Table 2: Summary of MAG quality, characteristics, taxonomy, associated file names, marker gene content and TAD80 across 300
freshwater to oligosaline lakes. Table 3: Number of novel MAGs in each phylum. Table 4: Generalized dissimilarity modelling results for MAG assemblages and community (ASV and metagenome)
profiles across lakes based on taxonomic and functional composition. PEER REVIEW FILE SOURCE DATA SOURCE DATA FIG. 1 Statistical source data. SOURCE DATA FIG. 2 Statistical source data.
SOURCE DATA FIG. 3 Statistical source data. SOURCE DATA FIG. 4 Statistical source data. SOURCE DATA FIG. 5 Statistical source data. RIGHTS AND PERMISSIONS Springer Nature or its licensor
(e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted
manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. Reprints and permissions ABOUT THIS ARTICLE CITE THIS ARTICLE Garner,
R.E., Kraemer, S.A., Onana, V.E. _et al._ A genome catalogue of lake bacterial diversity and its drivers at continental scale. _Nat Microbiol_ 8, 1920–1934 (2023).
https://doi.org/10.1038/s41564-023-01435-6 Download citation * Received: 01 October 2022 * Accepted: 20 June 2023 * Published: 31 July 2023 * Issue Date: October 2023 * DOI:
https://doi.org/10.1038/s41564-023-01435-6 SHARE THIS ARTICLE Anyone you share the following link with will be able to read this content: Get shareable link Sorry, a shareable link is not
currently available for this article. Copy to clipboard Provided by the Springer Nature SharedIt content-sharing initiative