Gut microbiome composition reveals the distinctiveness between the bengali people and the indigenous ethnicities in bangladesh

Play all audios:

ABSTRACT Ethnicity has a significant role in shaping the composition of the gut microbiome, which has implications in human physiology. This study intends to investigate the gut microbiome

of Bengali people as well as several indigenous ethnicities (Chakma, Marma, Khyang, and Tripura) residing in the Chittagong Hill Tracts areas of Bangladesh. Following fecal sample collection

from each population, part of the bacterial 16 s rRNA gene was amplified and sequenced using Illumina NovaSeq platform. Our findings indicated that Bangladeshi gut microbiota have a

distinct diversity profile when compared to other countries. We also found out that Bangladeshi indigenous communities had a higher _Firmicutes_ to _Bacteroidetes_ ratio than the Bengali

population. The investigation revealed an unclassified bacterium that was differentially abundant in Bengali samples while the genus _Alistipes_ was found to be prevalent in Chakma samples.

Further research on these bacteria might help understand diseases associated with these populations. Also, the current small sample-sized pilot study hindered the comprehensive understanding

of the gut microbial diversity of the Bangladeshi population and its potential health implications. However, our study will help establish a basic understanding of the gut microbiome of the

Bangladeshi population. SIMILAR CONTENT BEING VIEWED BY OTHERS ELUCIDATING THE GUT MICROBIOME ALTERATIONS OF TRIBAL COMMUNITY OF ARUNACHAL PRADESH: PERSPECTIVES ON THEIR LIFESTYLE OR FOOD

HABITS Article Open access 31 October 2022 COMPARATIVE ANALYSIS OF GUT MICROBIOTA IN FREE RANGE AND HOUSE FED YAKS FROM LINZHOU COUNTY Article Open access 24 April 2025 GUT MICROBIOMES OF

AGROPASTORAL CHILDREN FROM THE ADADLE REGION OF ETHIOPIA REFLECT THEIR UNIQUE DIETARY HABITS Article Open access 01 December 2023 INTRODUCTION The human gut microbiome which is composed of

an enormous collection of microorganisms plays a crucial role in modulating various physiological processes, metabolic diseases, and immunological dysregulation. The composition of the gut

microbiome is influenced by a multitude of factors, including diet, lifestyle, environment, genetics, and ethnicity1. They forge a symbiotic relationship with the host in the

gastrointestinal tract, where they are monitored by the innate immune system using pattern recognition receptors such as toll-like receptors and Nucleotide-binding oligomerization

domain-containing protein (NOD)-like receptors2. A number of these bacteria are involved in regulating host metabolism by producing metabolites such as folate, indoles, secondary bile acids,

trimethylamine-N-oxide, neurotransmitters (e.g., serotonin, gamma amino butyric acid), and short-chain fatty acids3. Alteration in the host–symbiont relationships in the gut has been

observed in diseases like obesity, type 2 diabetes mellitus, non-alcoholic liver disease, cardio-metabolic diseases, and so on4. Since different ethnic groups have been reported to have

variations in genetics, food habits, lifestyle, their alterations can be greatly influenced by ethnicity. As a result, some ethnic groups are predisposed to certain metabolic diseases, while

others are inclined to others5. For example, high-risk variants of _NOD2_ are less prevalent in African Americans which makes their microbiome less prone to Crohn’s Disease6,7. Similarly,

the _lactase_ gene varies in different populations8. In the Japanese population higher abundance of _Bifidobacterium_ has been observed due to their characteristic _LCT_ genotypes9,10.

Hence, gut microbiome studies on ethnic groups can be helpful in identifying prognostic microbial biomarkers for various diseases. The genetic makeup, dietary patterns, and geographic

regions shape the gut microbiome of various ethnicities in a distinct pattern11,12,13,14. Previously, a number of gut microbiome studies in South-East Asia have been conducted. Gut

microbiomes of populations living in India and China which are geographically close to Bangladesh, showed an abundance of _Prevotella_ and _Bacteroidetes_ respectively5. In India, gut

microbiomes from the Adi, Apatani, and Nyshi tribes of Arunachal Pradesh (a state in Northeastern India) demonstrated the predominance of _Firmicutes_, _Bacteroidates_, _Actinobacteria_, and

_Proteobacteria_. On the other hand, Mongoloid and Proto-Australoid tribes of India were dominated by _Prevotella_15,16. However, studies from ethnic groups in Bangladesh are still not

available. Bangladeshi ethnic groups have diverse dietary habits. Bengali food customs stem from their agrarian culture, which is defined by a profusion of rice, fish, and different

vegetables. Dishes like biryani and kebabs are examples of the historical effects of Mughal and British dynasties on the cuisine. Seasonal fruit preparations, street snacks like shingara and

samosas, and traditional cakes called “Pitha” are all common17. Bangladeshi tribal people use forest resources to eat diversely. They feed on ‘bans koral,’ bamboo shoots from _Melocanna

baccifera_ and _Bambusa tulda_ during the rainy season. Fresh or dried wild plants commonly accompany staple grain recipes. Acidic leaves are often eaten as salad or chutney. Woody

perennials including _Albizia procera_, wild mango, and _Daemonorops jenkinsianus_ are vegetables for these ethnic groups. Lentinus, Shizophyllum, and Jew’s Ear mushrooms from decomposing

wood are eaten. Wild banana inflorescences and leaf sheath soft cores are also edible. During food shortages, banana core cooked with rice or bran is crucial18. Bangladesh is a country

inhabited by many different ethnicities. Bengalis are the largest among them. Other major groups are Chakma, Marma, Khyang, and Tripura with distinguishable food habits and lifestyles. The

Chittagong Hill Tracts in the southeastern parts of Bangladesh are home to the Chakma, Marma, Khyang, and Tripura people. Bengali people (Bengali language speakers) have Indian ancestry with

Proto-Australoid, Caucasoid, and Mongoloid genetics19,20,21. On the other hand, Chakma, Marma, Khyang, and Tripura people (Tibeto-Burman language speakers) have predominance of

Tibeto-Burman genetics. Tibeto-Burman populations originated from northwestern China and moved to the South. They have interbred with numerous southern tribes for the past 2600 years,

creating specific genetic traits among southern Tibeto-Burman populations22,23. Hence, Tibeto-Burman speakers from Northeast India are more distinctive than the ones living in the

Himalayas23. In Bangladesh, the Tibeto-Burman populations (e.g., Chakma, Marma, Khyang, Tripura etc.) have higher similarity with Northeast Indian Tibeto-Burman but they contain more

mainland Indian ancestry24. In this study, we investigated the gut microbiomes of Bengali, Chakma, Marma, Khyang, and Tripura populations in order to determine whether Bangladeshi

Tibeto-Burman populations differ from Bengali people in terms of their gut microbiomes. METHODS SAMPLE COLLECTION Prior to sample collection, written informed consent was obtained from all

participants. Ethical approval and necessary permits were obtained from the National Institute of Biotechnology Ethical Review Committee, Bangladesh (NIBERC2022-01). All ethical regulations

relevant to human research participants were followed. Supplementary Data 1 contains detailed sample information regarding age, gender, and cohort. All of the indigenous volunteers (Chakma,

Marma, Khyang, and Tripura) in this study reside in the Rangamati district (22°37’60 N 92°12’0E) of the Chittagong Hill Tracts region. Bengali samples were collected from Dhaka Division

(23.9536° N, 90.1495° E). Based on ethnicity, the participants were divided into five groups (Bengali, Chakma, Marma, Khyang, and Tripura). A total of 55 individuals were sampled, of which

13 were Bengali, 15 were Chakma, 10 were Khyang, 6 were Marma, and 11 were Tripura. Fecal samples of the participants were collected using sterile stool collection tubes. The samples were

then transported to the National Institute of Biotechnology using Icebox and subsequently stored at −80 °C temperature. Fecal DNA extraction was executed using the PureLink™ Microbiome DNA

Purification Kit (Catalog number: A29790). Specialized beads were used along with a combination of heat, chemical, and mechanical disruption to lyse the microorganisms efficiently.

Precipitation with a proprietary cleaning buffer removed inhibitors. After that, the samples were placed in spin columns, and the DNA attached to the column was washed once before elution.

DNA concentration and purity were estimated by Thermo Scientific NanoDrop 2000/2000c and then stored at −20 °C. AMPLICON GENERATION AND LIBRARY PREPARATION FOR SEQUENCING Amplicons were

generated using the 341 F (5’-CCTAYGGGRBGCASCAG-3’) and 806 R (5’-GGACTACNNGGGTATCTAAT-3’) primers that targeted the V3-V4 region of the bacterial 16 S rRNA gene. All PCR reactions were

performed with 15 μL of Phusion® High-Fidelity PCR Master Mix (New England Biolabs), 0.2 μM of forward and reverse primers, and around 10 ng template DNA. For thermal cycling, initial

denaturation at 98 °C for 1 min was followed by 30 cycles of denaturation at 98 °C for 10 s, annealing at 50 °C for 30 s, and elongation at 72 °C for 30 s. Final elongation was carried out

for 5 min at 72 °C. TruSeq® DNA PCR-Free Sample Preparation Kit (Illumina, USA) was used as per the manufacturer’s protocol for sequencing library preparation, and index codes were appended.

The quality of the library was evaluated by employing the [email protected] Fluorometer (Thermo Scientific) and the Agilent Bioanalyzer 2100 system. Finally, the library was sequenced on an

Illumina NovaSeq platform, resulting in 250 bp paired-end reads. DATA PRE-PROCESSING AND QUALITY CONTROL The paired end sequences were converted into QIIME2 format for data pre-processing,

quality control, taxonomic assignment, differential abundance identification, functional analysis using QIIME2 platform (version 2021.4.0)25. More specifically, the data pre-processing of

paired end sequences were performed using the DADA2 plug-in within QIIME226. DADA2 filtered noisy reads, performed error correction in marginal sequences, removed chimeras and singletons,

joined denoised paired-end reads, and also de-replicated the filtered reads. The features produced by DADA2 are denoted as amplicon sequence variants. TAXONOMIC ASSIGNMENT For taxonomic

assignment, a pre-trained classifier based on the Naive Bayes machine-learning model was implemented. This model was trained on 99% sequence similarity of Greengenes 13_8 data. This

classifier was then deployed for taxonomy assignment of amplicon sequence variants. DIVERSITY ANALYSIS Several diversity metrics in QIIME2 require a rooted phylogenetic tree generated from

the amplicon sequence variants of the sampled data. A reference-based fragment insertion method, using q2-fragment-insertion tool, was applied to construct the rooted tree for this purpose.

Greengenes 13_8 data was used as a reference database in the q2-fragment-insertion tool27,28. The sequencing depth of the samples was 3525 to observe the richness. This phenomenon was

checked with the alpha rarefaction curve generated by the q2-diversity tool. The microbiome within and between samples was calculated by the core-metric-phylogenetic method of the

q2-diversity tool. This method computes several alpha (Observed features, Shanon diversity, Faith’s phylogenetic diversity, Pielou evenness) and beta (Jaccard distance, Bray–Curtis distance,

unweighted UniFrac distance, and weighted UniFrac distance) diversity metrics altogether29. Based on each beta diversity metric, this command also performed principal coordinates analysis

(PCoA)30. To visualize the PCoA plots for every beta diversity metric, EMPeror visualization tool was utilized to generate the figures31. Several statistical tests were conducted during

diversity analysis such as Kruskal–Wallis H test, two-way ANOVA, paired _t_-test and PERMANOVA test32,33,34,35. The boxplot() function in R was used to draw boxplots based on alpha diversity

values. This function follows the 1.5 IQR method for detecting outliers which are placed above and below the whiskers on the boxplots. Here, only one sample was identified as outlier

(Sample ID: BBT19). The result including the outlier has been provided in Supplementary Fig. 1. After removal of the outlier, the number of samples used for analysis were reduced from 55 –

54. DIFFERENTIAL ABUNDANCE TEST To classify the features that were differentially abundant across various sample groups the analysis of the composition of microbiomes (ANCOM) method was

applied by the q2-composition tool36. This statistical framework was deployed at the genus level. The minimum sample size for each feature was set to 27 (half of the total samples) because

ANCOM fails to manage false discovery rates at sample sizes <10 as well as to remove very lowly abundant features37. For linear discriminant analysis by the LEfSe, the feature table was

collapsed at the genus level and the minimum sample size was chosen at 2738. This tool first performs the non-parametric Kruskal-Wallis (KW) sum-rank test to identify the features which had

significant differential abundance across different metadata categories. Finally, LEfSe applies linear discriminant analysis to compute the effect size of each differentially abundant

feature and plot the linear discriminant analysis score in the log10 scale. Result of both ANCOM and LEfSe analysis conducted for all samples including the previously mentioned outliers has

been provided in Supplementary Fig. 2. FUNCTIONAL ANALYSIS BURRITO, an interactive visualization web server (http://elbo-spice.cs.tau.ac.il/shiny/burrito/), was utilized to explore the

taxa-function relationship within the samples of the study39. To acquire gene contents and functional annotations, this tool adopts the PICRUSt and KEGG Orthology databases,

respectively40,41. At first, features with a sample size of <27 were filtered from the original feature table to remove very low abundant taxon. The q2-vsearch tool was employed for

closed-reference clustering of retrieved features at 97% identity based on greengenes 97% OTU IDs as reference 42,43. Thus the acquired OTU table was then converted to the appropriate table

format for BURRITO input. COMPARATIVE ANALYSIS WITH TROPICAL AND SUBTROPICAL COUNTRIES Since Bangladesh is a tropical country, we compared the gut microbiome of Bangladeshi samples with

several tropical and subtropical countries to see if there is any similarity between them. To do this we took sequence data from publicly available 16 s rRNA amplicon sequence data from the

NCBI Sequence Read Archive and MG-RAST databases. Only healthy / control samples were taken from the selected datasets. Countries included in the comparative analysis were Australia, Egypt,

India, Indonesia, Malaysia, Mexico, Thailand, and Vietnam. Information about the samples taken from NCBI and MG-RAST has been presented in Supplementary Data 2 and Supplementary Data 3

respectively. The samples downloaded from MG-RAST were processed using q2-deblur while the samples downloaded from NCBI were processed using q2-dada234,44. This is because the samples were

taken from different regions and were prepared and sequenced using different methods. Amplicon sequencing using the Illumina MiSeq technology was done on Indian samples, encompassing the

V3-V4 region of bacterial 16 S rDNA. On the other hand, the V4 region of bacterial 16 S ribosomal RNA genes from Malawi, Amerindians, and the United States was amplified and sequenced using

an Illumina HiSeq 2000. The V1-V3 region of the 16 S ribosomal RNA (rRNA) gene of the Mongolian samples was amplified and sequenced using pyrosequencing on a Roche GS FLX. The samples were

merged with the qiime feature-table merge method. All the samples were rarefied to the same depth (3525). These foreign samples (_n_ = 181) were then compared with all Bangladeshi samples

(_n_ = 55) using the q2-diversity tool via Alpha diversity and Beta Diversity analysis. Afterwards, a phylogenetic rooted tree was generated by the sepp fragment insertion approach. This

rooted phylogeny was used to create a Unweighted Pair Group Method with Arithmetic Mean tree based on the Unweighted UniFrac metric by beta-rarefaction command of the q2-diversity tool45.

REPORTING SUMMARY Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article. RESULTS BENGALI POPULATION POSSESSED LOWER

_FIRMICUTES_ TO _BACTEROIDETES_ RATIO COMPARED TO INDIGENOUS GROUPS Among the 19 classified phyla, _Firmicutes_ and _Bacteroidetes_ were the most prevalent (48% and 34% of the total) in the

gut microbiome of all Bangladeshi populations. Other phyla with higher abundance were Proteobacteria (14%), Actinobacteria (3%), Tenericutes (0.5%), etc (Supplementary Fig. 3).

Interestingly, 100% of taxa from the _Bacteroidetes_ phylum belonged to the _Bacteroidia_ class. On the other hand, considering _Firmicutes_ phylum, 87% of features are _Clostridia_ class,

11% _Bacilli_, and 2% are _Erysipelotrichi_ (Supplementary Fig. 4). In Supplementary Data 4, the taxonomic classification of each feature id is documented, along with the confidence value.

The normalized abundance of the top ten genera across different cohorts is represented in cpm (counts per million) (Fig. 1a). The _Prevotella_ genus abundance was relatively similar in all

cohorts. Nevertheless, the prevalence of the _Bacteroides_ genus was drastically higher in the Chakma population and very low in Marma and Tripura samples. Moreover, Chakma tribal group also

contained _Faecalibacterium_, _Roseburia_, and uncharacterized genera from the _Ruminococcaceae_ family and _Bacteroidales_ order in a relatively higher amount. On the other hand,

_Streptococcus_ and _Megasphaera_ genera were highly abundant in the Bengali and Tripura populations and the Marma cohort contained these genera in relatively lower quantities. The Khyang

group contained the genus _Dialister_ and uncharacterized genera from _Enterobacteriaceae_ and _Ruminococcaceae_ families. About 0.05% amplicon sequence variants were classified as archaea

of the _Methanobacteriaceae_ family, of which only 0.1% were from the _Methanosphaera_ genus, and the rest of the features belonged to the _Methanobrevibacter_ genus (Fig. 1b). Twenty-four

families of features were found as core features and the frequency heatmap of these features revealed that there were two main clusters, one with higher frequency and another with lower

frequency (Fig. 1c). The _Streptococcaceae_, _Prevotellaceae_, _Veillonellaceae_, _Lachnospiraceae_, _Ruminococcaceae_, _Enterobacteriaceae_, and _Bacteroidaceae_ families were in the higher

frequency cluster. Most of the samples from the Bengali population clustered together and contained these higher frequency microbial families in relatively lower amounts. The median value

of the _Firmicutes_ to _Bacteroidetes_ ratio was highest (2.018) in the case of Tripura samples, while the Bengali population had the lowest (0.877) median ratio (Fig. 1d). All other cohorts

(Chakma, Khyang, Tripura) had a median _Firmicutes_ to _Bacteroidetes_ ratio of more than one. THE BENGALI POPULATION SHOWED INFERIOR MICROBIOME RICHNESS THAN OTHER ETHNIC COHORTS Alpha

diversity depicts the diversity within the sample. Alpha diversity is measured via Shannon diversity, Observed features, Faith pd and Pielou evenness parameters. Shannon diversity calculates

both the number of species in a community and their relative abundance. The median of Shannon diversity was relatively lower in the Bengali group compared to others (Fig. 2). For ethnicity

based cohorts, Kruskal-Wallis test for all groups had a _p-_value of 0.0033 (Supplementary Data 5). Observed features counted the number of distinct features present in the cohorts (Fig. 2).

Chakma, Marma, Khyang and Tripura populations had higher species richness than the Bengalis (_p-_value = 0.0003) (Supplementary Data 6). Faith pd incorporated information of the

evolutionary relationships between different bacterial species. Bengali samples had lower Fatih _p_-values than indigenous populations (Fig. 3). However, the difference in Faith pd score was

statistically insignificant (_p_-value = 0.3816) (Supplementary Data 7). Pielou evenness measured how evenly different species’ abundances were distributed within a community (Fig. 3).

There was no significant difference in Pielou evenness profile between indigenous populations (_p-_value = 0.0631) (Supplementary Data 8). PRESENCE AND ABSENCE OF VARIOUS SPECIES MADE

BENGALI MICROBIOME DISTINCT The PCoA plot revealed that the Bengali cohort was distinct from others in terms of Jaccard distance based on ethnicity (Fig. 4b) Jaccard distance is a binary

distance matrix that only considers whether a taxon is present or not. In other metrics, such as Bray–Curtis, weighted and unweighted UniFrac distances calculate presence and relative

abundances of microbial taxa but no separated clusters were observed there (Figs. 4a, c, and d). _ALISTIPES_ GENUS IS DIFFERENTIALLY ABUNDANT IN CHAKMA POPULATION ANCOM analysis revealed the

microbial community compositions between the various Bangladeshi ethnic groups. It used a log-ratio test to compare the relative abundance using non-parametric methods. Here, only the

Bengali vs. Non-Bengali and the Chakma vs. Non-Chakma comparisons demonstrated differentially abundant taxa between the groups. In terms of ANCOM, the other groups (Marma, Khyang, and

Tripura) did not reveal any differentially abundant bacteria. In the Bengali group, an unclassified bacteria showed a negative centered log ratio (clr) with higher W score (Fig. 5a).

Whereas, the Chakma population depicted abundance for _Alistipes_ and _Odoribacter_ (Fig. 5b). Another tool, LEfSe implemented statistical methods to identify microbial features that were

differentially abundant. LEfSe unveiled differentially abundant genus _Paraprevotella_, _Lactonifactor, Barnesiella, Bacteroides_, _Alistipes_, and _Ruminococcus_ within the Chakma

population (Fig. 5c). The results from ANCOM and LEfSe analysis complemented each other as they both unveiled that the genus _Alistipes_ is differentially abundant in the Chakma population.

BENGALI GUT MICROBIOME CONTRIBUTED TO MARKEDLY DIVERSE BIOLOGICAL PATHWAYS Relationships between microbiome composition and its effect on different biological functions were also explored in

this study. Among all metadata categories, only the Bengali, Chakma, and Marma groups showed differential enrichment of function based on their gut microbiome composition (Fig. 6). All

differentially abundant functions along with related contributing taxa and BH (Benjamini and Hochberg) FDR-adjusted _p_-values are tabulated in Supplementary Data 9. Thirty-three pathways

were differentially abundant in Bengali samples. Several of them were highly enriched in Bengali samples such as peptidases, Ribosomes, Purine, and Pyrimidine metabolism (Fig. 6a). The genus

_Bacteroides_, _Blautia_, _Collinsella_, _Coprococcus_, _Dorea_, _Parabacteroides_, SMB53, and _Slackia_ were the top significantly contributing taxa in the enrichment of these functions.

Chakma samples seemed to be enriched with histidine metabolism, arginine biosynthesis, and transcription machinery (Fig. 6b). But, in the case of the Marma samples histidine metabolism,

arginine biosynthesis functions were in relatively lower abundance than other Non-Marma samples, and transcription machinery function was in higher frequency (Fig. 6c). For both categories,

the genus _Bacteroides_, _Blautia_, and _Parabacteroides_ were the most contributing taxa for function abundance. Detailed statistics for significant pathways and taxa contributions for each

pathway are documented in Supplementary Data 10. BANGLADESHI GUT MICROBIOME HAS A DISTINCT COMPOSITION COMPARED TO TROPICAL AND SUBTROPICAL POPULATIONS When compared with the gut microbiome

of several other countries, the Bangladeshi samples formed a vividly distinct cluster based on bacterial composition and abundance (Fig. 7a, b). Cluster for Bray–Curtis and Jaccard distance

indicates that bacterial composition and their abundances are unique for Bangladeshi populations. Their species richness was higher than other countries (Fig. 7f). The unweighted UniFrac,

which incorporates phylogenetic information, demonstrated that most of the samples from the Bangladeshi population clustered closer to the Indian samples (Fig. 7c). All numerical source data

of Fig. 7 can be found in Supplementary Data 11a–h. Furthermore, the Unweighted Pair Group Method with Arithmetic Mean based tree showed that most of the indigenous samples of Bangladesh

have clustered with different Indian indigenous groups while the Bangladeshi samples as a whole were scattered across different phylogenetic clades (Fig. 8). DISCUSSION The human gut

microbiome is an integral part of human growth and development and it is highly influenced by ethnicity14,46. Diverse ethnic populations reside in Bangladesh. The majority of the population

is Bengali whereas large indigenous communities such as the Chakma, Marma, Tripura, and Khyang live in Chittagong Hill Tracts region of the country. The eating habits of these communities

substantially differ from Bengali people. Their gut microbiomes, which may have an impact on important public health issues, can be influenced by their genetic make-up, geographic location,

and lifestyle. In this study, we conducted a pilot survey to shed light of these areas. Here, we have performed a 16 s rRNA gene amplicon sequencing to identify the gut microbiome

composition. To execute this, we have collected fecal samples from Bengali, Chakma, Marma, Khyang, and Tripura populations. After collecting the microbial DNA from fecal samples, parts of

the 16 S rRNA genes of the bacterial species were amplified. The amplified products were then sequenced. The sequence data was then processed and analyzed to draw interpretations. After all

the analysis, we found that at the phylum level, Bengali population has lower _Firmicutes_ to _Bacteroidetes_ ratio compared to indigenous groups. Among _Firmicutes_ phylum, _Dialister_ and

_Faecalibacterium_ are highly abundant in Bangladeshi microbiomes (Fig. 1a). On the other hand, _Prevotella_, a member of the _Bacteroidetes_ phylum, was distributed evenly across each

cohort. People from the Indian subcontinent usually have _Prevotella_ abundance in their gut47. Here, we have explored the alpha and beta diversity of the gut microbiome. Alpha diversity

depicted within-group diversity and beta diversity showed different diversity ratios between the groups. The alpha diversity of the Bengali population was lower compared to others according

to the Shannon diversity and species richness (Observed features) (Fig. 2). The overall species evenness was enriched for indigenous populations (Fig. 3). Based on the phylogenetic diversity

of the gut microbes, the Bengali and the people of the Chittagong Hill Tracts were found not to be very distant. However, our findings indicate that the overall gut microbial population and

abundance is unique for Bangladesh when compared with international datasets. However, most of the indigenous samples from Bangladesh (Chakma, Marma, Khyang, Tripura) clustered separately

with a common origin with the Indian indigenous population whereas Bengali samples were spread across various clades (Fig. 8). Some bacterial species were found to be abundant in indigenous

groups that were not enriched in Bengali samples or vice versa. To identify this differential abundance, we employed several statistical approaches using LEfSe and ANCOM. Both of these tools

identified an unclassified bacteria, which can be studied further, that is significantly abundant in Bengali samples whereas Chakma samples had enriched levels of _Alistipes_ species.

_Alistipes_ is a newly discovered genus under _Bacteroidetes_ phylum. The gut microbial pathways play a critical role in health and diseases48. In Bengali samples DNA repair system,

pyrimidine and purine metabolism, lipopolysaccharide (LPS) biosynthesis, peptidases, ribosome, etc functions were enriched. On the other hand, Chakma samples were enriched with histidine

metabolism and arginine biosynthesis (Fig. 6b) whereas in the Marma group abundance of these pathways were lower (Fig. 6c). Furthermore, Bangladeshi gut microbes possessed a distinctive

diversity profile. Further research is needed to investigate the specific differences in responses and underlying mechanisms associated with distinct gut microbiota profiles. LIMITATIONS OF

THE STUDY The primary limitation of the study is small sample size, which exhibits significant variability among individuals sampled. With a sample size of _n_ = 55 divided into five ethnic

groups, two genders, and three large age groups (20–40, 40–60, and 60–80 years), the representativeness for each combination is low. The gut microbiome (GM) is well-known to be highly

influenced by factors such as sex, age, current health status and so on. In our study, the representativeness of each of these groups is limited. A higher sample size would certainly enhance

the credibility of the differences between these groups and also help draw more meaningful conclusions. However, the primary goal of our study was to explore and characterize the gut

microbiome of Bangladeshi population for the first time and thus establish the baseline data for further gut microbiome research in Bangladesh. During the course of the study, as a secondary

goal, we also wanted to see if there is any difference in the gut microbiome of the various ethnicities. Since our main priority was to characterize the Bangladeshi population as a whole,

the difference between various subgroups based on age, sex, and health status was given less importance. We want to reiterate that the number of participants in the current study was not

sufficient to provide a comprehensive representation of the overall gut microbial diversity within neither the Bangladeshi population as a whole nor the ethnicities studied (Bengali, Chakma,

Marma, Khyang, and Tripura). As a result, the findings from this study may lack statistical power and robustness, leading to potential biases and limited generalizability of the results.

Moreover, the small sample size may hinder the identification of subtle microbial variations that could be crucial in understanding complex interactions between the gut microbiota and

various health conditions. Consequently, caution must be exercised when interpreting the results of this gut microbiome study due to its limited sample size. Further research with larger

cohorts is absolutely necessary to draw more definitive and reliable conclusions about the role of gut microbiota in the Bangladeshi population. In conclusion, the current study indicates

that the indigenous gut microbiome was more diverse and distinct from the Bengali population. Our study will help establish the baseline data for the gut microbiome of the Bangladeshi

population. DATA AVAILABILITY All newly generated 16rRNA amplicon sequencing data used in this study can be freely accessed via NCBI BioProject number PRJNA876782. Source data have been

included in the supplementary data files. All other data are available from the corresponding author on reasonable request. CODE AVAILABILITY Code can be accessed from the corresponding

author on reasonable request. Versions of software used to process the current dataset: QIIME2 (version 2021.4.0). REFERENCES * Bull, M. J. & Plummer, N. T. Part 1: The human gut

microbiome in health and disease. _Integr. Med. Clin. J._ 13, 17–22 (2014). Google Scholar * Chassaing, B., Kumar, M., Baker, M. T., Singh, V. & Vijay-Kumar, M. Mammalian gut immunity.

_Biomed. J._ 37, 246–258 (2014). Article PubMed Google Scholar * Cani, P. D. Human gut microbiome: hopes, threats and promises. _Gut_ 67, 1716–1725 (2018). Article CAS PubMed Google

Scholar * Fan, Y. & Pedersen, O. Gut microbiota in human metabolic health and disease. _Nat. Rev. Microbiol._ 19, 55–71 (2021). Article CAS PubMed Google Scholar * Gupta, V. K.,

Paul, S. & Dutta, C. Geography, ethnicity or subsistence-specific variations in human microbiome composition and diversity. _Front. Microbiol._ 8, 1162 (2017). Article PubMed PubMed

Central Google Scholar * Adeyanju, O. et al. Common NOD2 risk variants in African Americans with Crohn’s disease are due exclusively to recent caucasian admixture. _Inflamm. Bowel Dis._

18, 2357–2359 (2012). Article PubMed Google Scholar * Lauro, M. L., Burch, J. M. & Grimes, C. L. The effect of NOD2 on the microbiota in Crohn’s disease. _Curr. Opin. Biotechnol._ 40,

97–102 (2016). Article CAS PubMed PubMed Central Google Scholar * Charati, H. et al. The evolutionary genetics of lactase persistence in seven ethnic groups across the Iranian plateau.

_Hum. Genomics_ 13, 7 (2019). Article CAS PubMed PubMed Central Google Scholar * Hall, A. B., Tolonen, A. C. & Xavier, R. J. Human genetic variation and the gut microbiome in

disease. _Nat. Rev. Genet._ 18, 690–699 (2017). Article CAS PubMed Google Scholar * Kato, K. et al. Association between functional lactase variants and a high abundance of

bifidobacterium in the gut of healthy Japanese people. _PloS One_ 13, e0206189 (2018). Article PubMed PubMed Central Google Scholar * Huang, T., Shu, Y. & Cai, Y.-D. Genetic

differences among ethnic groups. _BMC Genomics_ 16, 1093 (2015). Article PubMed PubMed Central Google Scholar * Bennett, G., Bardon, L. A. & Gibney, E. R. A comparison of dietary

patterns and factors influencing food choice among ethnic groups living in one locality: a systematic review. _Nutrients_ 14, 941 (2022). Article PubMed PubMed Central Google Scholar *

Manica, A., Prugnolle, F. & Balloux, F. Geography is a better determinant of human genetic differentiation than ethnicity. _Hum. Genet._ 118, 366–371 (2005). Article PubMed PubMed

Central Google Scholar * Dwiyanto, J. et al. Ethnicity influences the gut microbiota of individuals sharing a geographical location: a cross-sectional study from a middle-income country.

_Sci. Rep._ 11, 2618 (2021). Article CAS PubMed PubMed Central Google Scholar * Dehingia, M. et al. Gut bacterial diversity of the tribes of India and comparison with the worldwide

data. _Sci. Rep._ 5, 18563 (2015). Article CAS PubMed PubMed Central Google Scholar * Hazarika, P., Chattopadhyay, I., Umpo, M., Choudhury, Y. & Sharma, I. Elucidating the gut

microbiome alterations of tribal community of Arunachal Pradesh: perspectives on their lifestyle or food habits. _Sci. Rep._ 12, 18296 (2022). Article CAS PubMed PubMed Central Google

Scholar * Alam, S. M. N. & Naser, M. N. Chapter 2—Role of traditional foods of Bangladesh in reaching-out of nutrition. In _Nutritional and Health Aspects of Food in South Asian

Countries_ (eds. Prakash, J., Waisundara, V. & Prakash, V.) 217–235 (Academic Press, 2020) https://doi.org/10.1016/B978-0-12-820011-7.00025-3. * Haque, M. M. Dietary practice among

mainstream Bengali population and ethnic communities in Bangladesh. _Arch. Nutr. Food Sci_. 1, 5–7 (2020). * Chakraborty, R. et al. Gene differentiation among ten endogamous groups of West

Bengal, India. _Am. J. Phys. Anthropol._ 71, 295–309 (1986). Article CAS PubMed Google Scholar * Saha, N. Blood genetic markers in Bengali muslims of Bangladesh. _Hum. Hered._ 37, 86–93

(1987). Article CAS PubMed Google Scholar * Hasan, M. M. et al. Phylogenetic and forensic studies of the Bangladeshi population using next-generation powerPlex® Y23 STR marker system.

_Int. J. Leg. Med._ 130, 1493–1495 (2016). Article Google Scholar * Wen, B. et al. Analyses of genetic structure of Tibeto-Burman populations reveals sex-biased admixture in Southern

Tibeto-Burmans. _Am. J. Hum. Genet._ 74, 856–865 (2004). Article CAS PubMed PubMed Central Google Scholar * Gayden, T. et al. Genetic insights into the origins of Tibeto-Burman

populations in the himalayas. _J. Hum. Genet._ 54, 216–223 (2009). Article CAS PubMed Google Scholar * Gazi, N. N. et al. Genetic structure of Tibeto-Burman populations of Bangladesh:

evaluating the gene flow along the sides of Bay-of-Bengal. _PLoS One_ 8, e75064 (2013). Article CAS PubMed PubMed Central Google Scholar * Bolyen, E. et al. Reproducible, interactive,

scalable and extensible microbiome data science using QIIME 2. _Nat. Biotechnol._ 37, 852–857 (2019). Article CAS PubMed PubMed Central Google Scholar * Callahan, B. J. et al. DADA2:

high-resolution sample inference from Illumina amplicon data. _Nat. Methods_ 13, 581–583 (2016). Article CAS PubMed PubMed Central Google Scholar * Janssen, S. et al. Phylogenetic

placement of exact amplicon sequences improves associations with clinical information. _mSystems_ 3, e00021–18 (2018). Article CAS PubMed PubMed Central Google Scholar * Matsen, F. A.,

Hoffman, N. G., Gallagher, A. & Stamatakis, A. A format for phylogenetic placements. _PLoS One_7, e31009 (2012). Article CAS PubMed PubMed Central Google Scholar * Weiss, S. et al.

Normalization and microbial differential abundance strategies depend upon data characteristics. _Microbiome_ 5, 27 (2017). Article PubMed PubMed Central Google Scholar * Halko, N.,

Martinsson, P.-G., Shkolnisky, Y. & Tygert, M. An algorithm for the principal component analysis of large data sets. _SIAM J. Sci. Comput._ 33, 2580–2594 (2011). Article Google Scholar

* Vázquez-Baeza, Y., Pirrung, M., Gonzalez, A. & Knight, R. EMPeror: a tool for visualizing high-throughput microbial community data. _GigaScience_ 2, 16 (2013). Article PubMed

PubMed Central Google Scholar * Kruskal, W. H. & Wallis, W. A. Use of ranks in one-criterion variance analysis. _J. Am. Stat. Assoc._ 47, 583–621 (1952). Article Google Scholar *

McKinney, W. Data Structures for Statistical Computing in Python. _Proc. 9th Python Sci. Conf._ 56–61 (2010). * Bokulich, N. A. et al. q2-longitudinal: longitudinal and paired-sample

analyses of microbiome data. _mSystems_ 3, e00219–18 (2018). Article PubMed PubMed Central Google Scholar * Anderson, M. J. A new method for non-parametric multivariate analysis of

variance. _Austral Ecol._ 26, 32–46 (2001). Google Scholar * Mandal, S. et al. Analysis of composition of microbiomes: a novel method for studying microbial composition. _Microb. Ecol.

Health Dis._ 26, 27663 (2015). PubMed Google Scholar * Lin, H. & Peddada, S. D. Analysis of microbial compositions: a review of normalization and differential abundance analysis. _NPJ

Biofilms. Microbiomes._ 6, 1–13 (2020). Article Google Scholar * Segata, N. et al. Metagenomic biomarker discovery and explanation. _Genome Biol._ 12, R60 (2011). Article PubMed PubMed

Central Google Scholar * McNally, C. P., Eng, A., Noecker, C., Gagne-Maynard, W. C. & Borenstein, E. BURRITO: An interactive multi-omic tool for visualizing taxa–function relationships

in microbiome data. _Front. Microbiol._ 9, 365 (2018). Article PubMed PubMed Central Google Scholar * Langille, M. G. I. et al. Predictive functional profiling of microbial communities

using 16S rRNA marker gene sequences. _Nat. Biotechnol._ 31, 814–821 (2013). Article CAS PubMed PubMed Central Google Scholar * Kanehisa, M., Sato, Y., Kawashima, M., Furumichi, M.

& Tanabe, M. KEGG as a reference resource for gene and protein annotation. _Nucleic Acids Res._ 44, D457–D462 (2016). Article CAS PubMed Google Scholar * DeSantis, T. Z. et al.

Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. _Appl. Environ. Microbiol._ 72, 5069–5072 (2006). Article CAS PubMed PubMed Central Google Scholar

* Rognes, T., Flouri, T., Nichols, B., Quince, C. & Mahé, F. VSEARCH: a versatile open source tool for metagenomics. _PeerJ_ 4, e2584 (2016). Article PubMed PubMed Central Google

Scholar * Amir, A. et al. Deblur rapidly resolves single-nucleotide community sequence patterns. _mSystems_ 2, e00191–16 (2017). Article PubMed PubMed Central Google Scholar * Letunic,

I. & Bork, P. Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees. _Nucleic Acids Res._ 44, W242–W245 (2016). Article CAS

PubMed PubMed Central Google Scholar * Guinane, C. M. & Cotter, P. D. Role of the gut microbiota in health and chronic gastrointestinal disease: understanding a hidden metabolic

organ. _Ther. Adv. Gastroenterol._ 6, 295–308 (2013). Article Google Scholar * Prasoodanan, P. K. V. et al. Western and non-western gut microbiomes reveal new roles of prevotella in

carbohydrate metabolism and mouth–gut axis. _NPJ Biofilms. Microbiomes._ 7, 1–17 (2021). Article Google Scholar * Hou, K. et al. Microbiota in health and diseases. _Signal Transduct.

Target. Ther._ 7, 1–28 (2022). Google Scholar Download references ACKNOWLEDGEMENTS This study was funded under the “Establishment of National Gene Bank Project”, National Institute of

Biotechnology (NIB), Ministry of Science and Technology, Government of the People’s Republic of Bangladesh. We would like to thank Md. Amjad Hossain, Ritaren Chakma, Tina Tripura, Barshan

Chakma, and Suborna Dash for their assistance with sample and metadata collection. AUTHOR INFORMATION Author notes * These authors contributed equally: Ishtiaque Ahammad, Arittra

Bhattacharjee, Zeshan Mahmud Chowdhury, Anisur Rahman, and Mohammad Uzzal Hossain. AUTHORS AND AFFILIATIONS * Bioinformatics Division, National Institute of Biotechnology, Ganakbari,

Ashulia, Savar, Dhaka, 1349, Bangladesh Ishtiaque Ahammad, Arittra Bhattacharjee, Zeshan Mahmud Chowdhury, Anisur Rahman & Mohammad Uzzal Hossain * Rangamati Medical College, Hospital

Road, Rangamati-4500, Rangamati, Bangladesh Gourab Dewan & Shiny Talukder * Molecular Biotechnology Division, National Institute of Biotechnology, Ganakbari, Ashulia, Savar, Dhaka, 1349,

Bangladesh Keshob Chandra Das & Md Salimullah * Department of Biochemistry and Microbiology, North South University, Bashundhara, Dhaka, 1229, Bangladesh Chaman Ara Keya Authors *

Ishtiaque Ahammad View author publications You can also search for this author inPubMed Google Scholar * Arittra Bhattacharjee View author publications You can also search for this author

inPubMed Google Scholar * Zeshan Mahmud Chowdhury View author publications You can also search for this author inPubMed Google Scholar * Anisur Rahman View author publications You can also

search for this author inPubMed Google Scholar * Mohammad Uzzal Hossain View author publications You can also search for this author inPubMed Google Scholar * Gourab Dewan View author

publications You can also search for this author inPubMed Google Scholar * Shiny Talukder View author publications You can also search for this author inPubMed Google Scholar * Keshob

Chandra Das View author publications You can also search for this author inPubMed Google Scholar * Chaman Ara Keya View author publications You can also search for this author inPubMed

Google Scholar * Md Salimullah View author publications You can also search for this author inPubMed Google Scholar CONTRIBUTIONS Conceptualization: Mohammad Uzzal Hossain, Keshob Chandra

Das, Chaman Ara Keya, Md Salimullah, Analysis: Ishtiaque Ahammad, Arittra Bhattacharjee, Zeshan Mahmud Chowdhury, Anisur Rahman, Resources: Gourab Dewan, Shiny Talukder, Keshob Chandra Das,

Md Salimullah. Writing—Original Draft: Ishtiaque Ahammad, Arittra Bhattacharjee, Zeshan Mahmud Chowdhury, Anisur Rahman, Writing—Review & Editing: Ishtiaque Ahammad, Arittra

Bhattacharjee, Zeshan Mahmud Chowdhury, Anisur Rahman, Mohammad Uzzal Hossain, Chaman Ara Keya, Md Salimullah. Supervision: Keshob Chandra Das, Chaman Ara Keya, Md Salimullah. Funding

acquisition: Md Salimullah. CORRESPONDING AUTHOR Correspondence to Md Salimullah. ETHICS DECLARATIONS COMPETING INTERESTS The authors declare no competing interests. PEER REVIEW PEER REVIEW

INFORMATION _Communications Biology_ thanks the anonymous reviewers for their contribution to the peer review of this work. Primary Handling Editors: Kevin Theis and Tobias Goris. ADDITIONAL

INFORMATION PUBLISHER’S NOTE Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. SUPPLEMENTARY INFORMATION SUPPLEMENTARY

INFORMATION DESCRIPTION OF SUPPLEMENTARY MATERIALS SUPPLEMENTARY DATA 1-11 REPORTING SUMMARY RIGHTS AND PERMISSIONS OPEN ACCESS This article is licensed under a Creative Commons Attribution

4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and

the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s

Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not

permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

http://creativecommons.org/licenses/by/4.0/. Reprints and permissions ABOUT THIS ARTICLE CITE THIS ARTICLE Ahammad, I., Bhattacharjee, A., Chowdhury, Z.M. _et al._ Gut microbiome composition

reveals the distinctiveness between the Bengali people and the Indigenous ethnicities in Bangladesh. _Commun Biol_ 7, 500 (2024). https://doi.org/10.1038/s42003-024-06191-9 Download

citation * Received: 30 September 2022 * Accepted: 15 April 2024 * Published: 25 April 2024 * DOI: https://doi.org/10.1038/s42003-024-06191-9 SHARE THIS ARTICLE Anyone you share the

following link with will be able to read this content: Get shareable link Sorry, a shareable link is not currently available for this article. Copy to clipboard Provided by the Springer

Nature SharedIt content-sharing initiative