Chromosome-level genome assembly of ridgetail white shrimp exopalaemon carinicauda

Play all audios:

ABSTRACT _Exopalaemon carinicauda_, a eurythermal and euryhaline shrimp, contributes one third of the total biomass production of polyculture ponds in eastern China and is considered as a

potential ideal experimental animal for research on crustaceans. We conducted a high-quality chromosome-level genome assembly of _E. carinicauda_ combining PacBio HiFi and Hi-C sequencing

data. The total assembly size was 5.86 Gb, with a contig N50 of 235.52 kb and a scaffold N50 of 138.24 Mb. Approximately 95.29% of the assembled sequences were anchored onto 45

pseudochromosomes. BUSCO analysis revealed that 92.89% of 1,013 single-copy genes were highly conserved orthologs. A total of 44, 288 protein-coding genes were predicted, of which 70.53%

were functionally annotated. Given its high heterozygosity (2.62%) and large proportion of repeat sequences (71.49%), it is one of the most complex genome assemblies. This chromosome-scale

genome will be a valuable resource for future molecular breeding and functional genomics research on _E. carinicauda_. SIMILAR CONTENT BEING VIEWED BY OTHERS A CHROMOSOME-LEVEL GENOME

ASSEMBLY AND EVOLUTIONARY ANALYSIS OF _COREGONUS USSURIENSIS_ BERG Article Open access 18 July 2024 CHROMOSOME-SCALE ASSEMBLY OF _ARTEMIA TIBETIANA_ GENOME, FIRST AQUATIC INVERTEBRATE GENOME

FROM TIBET PLATEAU Article Open access 12 May 2025 CHROMOSOME-LEVEL ASSEMBLY OF _TRIPLOPHYSA YARKANDENSIS_ GENOME BASED ON THE SINGLE MOLECULE REAL-TIME SEQUENCING Article Open access 05

January 2024 BACKGROUND & SUMMARY The family Palaemonidae, including more than 1400 species in 181 genera, represents the largest family of the order Decapoda1. Animals from this family

are found in marine and freshwater environments in tropical to temperate regions worldwide. It includes several shrimps with high economic value, such as _Macrobrachium rosenbergii_,

_Macrobrachium nipponense_ and _Exopalaemon carinicauda_. The ridgetail white shrimp _E. carinicauda_ is a eurythermal and euryhaline shrimp distributed over a wide geographical area

throughout tropical, subtropical, and temperate coastal waters2,3. It can survive in a multitude of environmental extremes, has a broad salinity tolerance of 2–44 and can survive in

freshwater after domestication4. It is also capable of inhabiting temperatures as low as −3 °C and as high as 39 °C5,6. As one of the most commercially valuable pond-raised species of

shrimp, _E. carinicauda_ contributes to one third of the total production of polyculture ponds in eastern China7. In addition to its important economic value in aquaculture, it is considered

a potential ideal experimental animal for research on crustaceans for its moderate size, transparent body (Fig. 1), short reproductive cycle, large eggs (diameters ranging 0.57–1.08 mm) and

ease of culturing and breeding in captive conditions8. Currently, CRISPR/Cas9-mediated genome editing technology has been successfully used in _E. carinicauda_, which is the first time that

gene editing has been realized in a decapod crustacean9,10. However, the absence of genomic data limits the further application of gene editing in studying the molecular biology,

cytobiology and genetics of crustaceans. Therefore, a high-quality reference genome is essential for understanding the molecular biology, genetics, breeding, ecology and adaptation of _E.

carinicauda_. A fragmented draft genome of _E. carinicauda_ has been assembled using Illumina short reads containing 13,897,062 scaffolds (contig N50, 263 bp)11. Genome survey analysis

indicated that _E. carinicauda_ has a relatively large genome size of 5.73 Gb, which is at least twice as large as that of many decapod shrimps12,13,14. In this study, an improved

chromosome-level genome of _E. carinicauda_ was assembled using the PacBio sequencing platform, Illumina paired-end sequencing, and high-throughput chromatin conformation capture (Hi-C)

technology. Our previous studies suggested that the _E. carinicauda_ karyotype is 2n = 9015, similar to that of other _Exopalaemon_ species16. The final genome size was 5.86 Gb with a contig

N50 length of 235.52 kb and a scaffold N50 length of 138.24 Mb. A total of 44,288 protein-coding genes were predicted in the genome of _E. carinicauda_. This chromosome-level genome

assembly of _E. carinicauda_ provides a valuable genomic resource for further genetic improvement and understanding of the functional genes and molecular mechanisms of _E. carinicauda_.

METHODS ANIMAL MATERIALS AND GENOME SEQUENCING A female shrimp was collected from Rizhao Haichen Aquatic Co., Ltd. The muscle tissue was collected for DNA extraction and library

construction. Total genomic DNA was extracted using a cetyltrimethylammonium bromide method. For the genome survey, a 350 bp paired-end library was constructed according to the

manufacturer’s instructions (Illumina, San Diego, CA, USA) and sequenced on an Illumina NovaSeq 6000 platform. A total of 276.18 Gb of raw data were obtained, which covered approximately 54

× of the estimated genome (Table 1). For PacBio sequencing, a 15 kb library was constructed using the SMRTbell Express Template Prep Kit 2.0 (Pacific Biosciences, Menlo Park, CA, USA) and

sequenced with circular consensus sequencing mode using a single 8 M SMRT Cell on the PacBio Sequel II platform (Pacific Biosciences). After filtering out the low-quality reads and sequence

adapters, 3636.91 Gb subreads of PacBio Data were obtained, representing approximately 708 × sequence coverage based on the estimated genome size (Table 1). Finally, 203.27 Gb of CCS reads

were generated using SMRTLink 9.0 which covered approximately 40 × of the estimated genome. For the construction of the Hi-C library, DNA was fixed with 4% formaldehyde solution and digested

with the 4-cutter restriction enzyme MboI. The digested fragments were labeled with biotin-14-dCTP, then the cross-linked fragments were subjected to blunt-end ligation. The library was

sequenced on the Illumina NovaSeq 6000 platform, and approximately 552.65 Gb of Hi-C clean reads were generated, covering approximately 108 × of the estimated genome (Table 1). GENOME SURVEY

The genome size and heterozygosity were estimated using the k-mer method before genome assembly17. The k-mer distribution was calculated from Illumina short reads using Jellyfish based on

k-mer (k = 17)18. The heterozygosity ratio was estimated by the online tool of GenomeScope19 (https://github.com/schatzlab/genomescope). Finally, the estimated genome size of _E.

carinicauda_ was predicted to be approximately 5.12 Gb, with 84.74% repetitive sequences, and the genome heterozygosity was 2.62% using a 17-mer analysis (Fig. 2), suggesting a complex

genome of _E. carinicauda_. CHROMOSOME-LEVEL GENOME ASSEMBLY The initial genome was assembled with HiFi reads using the Peregrine (v0.1.6.1) (https://github.com/cschin/peregrine). A modified

“best overlap graph” strategy was used to get the contig assembly based on the overlap graph. Contig overlaps were removed from the assembled contig sequences using Purge_dups

(https://github.com/dfguan/purge_dups). _De novo_ assembly of PacBio sequences yielded a preliminary assembly of 5.86 Gb, containing 47,421 contigs with a contig N50 length of 235.28 kb, a

maximum length of 3,038,493 bp and a GC content of 34.79% (Table 1). Chromosome-level assembly of _E. carinicauda_ was conducted using Hi-C technology. Juicer (v1.6.2)20 and 3D-DNA

(v180922)21 software were implemented to obtain the chromosome-level whole genome assembly. The filtered Hi-C reads were aligned to the initial draft genome using Juicer (v1.6.2). Only

uniquely mapped and valid paired-end reads were used for the assembly using 3D-DNA. Juicebox (v1.9.8) was used to manually order the scaffolds to generate more precise chromosome-level

genome of _E. carinicauda_ according to the chromosomal interaction heatmap22. Contact maps were visualized using HiCExplorer (v3.3)23. The number of chromosomes was 90, which was determined

based on karyological observations of _E. carinicauda_ chromosomes in our previous study15. The contigs were ultimately clustered into 45 pseudochromosomes for _E. carinicauda_, with a

scaffold N50 length of 138.24 Mb. The total length of the 45 pseudochromosomes was 5.58 Gb (covered 95.29%) (Fig. 3a,b), of which the length ranged from 46.25 Mb to 338.48 Mb. The length of

the un-placed scaffolds was 275.86 Mb (Table 2). The quality of the final chromosome-level genome assembly was assessed using the following three methods. First, we aligned the Illumina DNA

short reads obtained from our previous study to the assembled genome and found that approximately 99.00% of the DNA short reads could be mapped to our assembly using BWA (v0.7.15)24. Second,

read depth and GC content with 10 kb windows were used to evaluate the assembly results and determine whether there was a significant GC bias or sample contamination, showing that the

assembled genome was clean without contamination (Fig. 4). Finally, genome assembly and completeness were further evaluated using conserved genes in benchmarking universal single-copy

orthologs (BUSCO, v5.2.2) with the arthropoda_odb10 database25. The results showed that 92.89% of the 1013 single-copy genes were highly conserved orthologs (88.75% complete, 4.15%

fragmented, and 7.11% missing) (Table 3). Compared to the published genome of _E. carinicauda_11, our assembled genome is of significantly improved quality and integrity. The contig N50

increased from 263 bp to 235,277 bp, with an increase of nearly 900-fold, and scaffold N50 increased from 816 bp to 138,242,434 bp. Meanwhile, the assembled complete orthologue proportion

enhanced from 43.44% to 88.75% according to the BUSCO assessment. REPETITIVE AND NON-CODING GENE PREDICTION To detect repeat elements in _E. carinicauda_ genome, _de novo_ and homology-based

strategies were combined using multiple methods. Mini-inverted repeat transposable elements (MITEs) were identified using MITE-Hunter (v1.0)26 for _de novo_ annotations. Long terminal

repeat sequences (LTRs) were detected using LTRharvest27 and LTR_Finder (v1.07)28, and the prediction results of these two software programs were integrated using LTR_retriever (v2.8.2)29.

RepeatMasker (v4.1.0)30 was used in the homology-based alignment to search _E. carinicauda_ genome sequence in the RepBase database (http://www.girinst.org/repbase). RepeatMasker was used to

mask the repetitive sequences obtained by the above method, and RepeatModeler (v2.0)31 was used to perform the _de novo_ identification of other repetitive sequences with the repeat-masked

genome. Ultimately, we identified approximately 4.19 Gb of repetitive sequences, accounting for approximately 71.49% of the assembled genome, among which 9.97% were tandem repeat sequences.

Among these repetitive sequences, LTRs (42.52%) accounted for the highest proportion of the assembly, followed by DNA (10.81%) and LINE (3.33%) (Table 4). Five types of noncoding RNA (ncRNA)

were identified in the genome of _E. carinicauda_, including microRNAs (miRNAs), transfer RNAs (tRNAs), ribosomal RNAs (rRNA), small nuclear RNAs (snRNAs) and small nucleolar RNAs

(snoRNAs). The tRNA was predicted using tRNAscan-SE (v2.0)32. Other types of ncRNAs were detected by alignment to Rfam database33 using infernal (v1.1.3) software34. In total, 10249

non-coding RNAs (ncRNAs) were annotated, including 3,702 rRNAs, 386 miRNAs, 5,811 tRNAs, 269 snRNAs, and 81 snoRNAs (Table 5). GENE PREDICTION AND ANNOTATION We detected the protein-coding

genes in the _E. carinicauda genome_ assembly by a comprehensive strategy that combined _ab initio_ prediction, protein-based homology searches, and RNA sequencing data predictions. For _ab

initio_ prediction, augustus (v3.2.2)35, SNAP (v6.0)36, Glimmer hmm (v3.0.4)37 and GeneMark-ET38 were used to predict the repeat-masked genome structure. For protein-based homology

prediction, the protein sequences of homologous species including _Daphnia pulex_ (GCA_021134715.1), _Procambarus virginalis_ (GCA_020271785.1), _Fenneropenaeus chinensis_ (GCA_019202785.2),

_Penaeus japonicus_ (GCA_017312705.1), _Penaeus monodon_ (GCA_015228065.1), _Litopenaeus vannamei_ (GCA_003789085.1), _Portunus trituberculatus_ (GCA_017591435.1) and _M. nipponense_

(GCA_015104395.1) were downloaded from the NCBI database and aligned against the _E. carinicauda_ genome using GeMoMa (v1.7.1)39 to perform homology prediction. Furthermore, the RNA-seq data

from different tissues and embryonic development stages (PRJNA594425, PRJNA746617, PRJNA756619, PRJNA881755, and PRJNA881756) were mapped to the genome by HISAT2 (v2.1.0)40. The full-length

transcripts (PRJNA594425) from our previous study41 were assembled using Cufflinks (v2.1.1)42, then the open reading frame was predicted using PASA (v20140417)43. The EVidenceModeler44 was

employed to consolidate the results from these three methods, enabling the merging and integration of gene predictions. Finally, 44,288 high-quality protein-coding genes were predicted.

These predicted genes displayed an average gene length of 28,448 bp, an average coding length of 1,424 bp and 6.09 coding exons per gene. These genes were functionally annotated using BLAST

against NR, SwissProt, eggNOG, InterPro, GO and KEGG45. The protein-coding gene functional annotation results were merged using the aforementioned methods. Finally, 70.53% of the total

predicted genes were successfully assigned with at least one functional annotation (Table 6). DATA RECORDS All sequencing data have been uploaded to the NCBI SRA database. The Illumina

sequencing data for genomic survey has been deposited in the NCBI Sequence Read Archive with accession number SRR2788058946 under BioProject accession number PRJNA1070324. The genomic PacBio

sequencing data has been deposited in the NCBI Sequence Read Archive with accession number SRR2775680047, SRR2775680148, SRR2786204449 and SRR2786204550 under BioProject accession number

PRJNA1070324. The Hi-C sequencing data has been deposited in the NCBI Sequence Read Archive with accession number SRR2788053551, SRR2788053652, SRR2788053753, SRR2788053854, SRR2788053955

and SRR2788054056 under BioProject accession number PRJNA1073006. The final chromosome-level assembled genome file has been uploaded to the GenBank database under the accession

JAZBEV00000000057. TECHNICAL VALIDATION To evaluate the integrity and accuracy of the genome assembly, the completeness of the final genome assembly was assessed using BUSCO (v5.2.2) and the

arthropoda_odb10 database25. It was shown that 92.89% of the 1013 single-copy genes were highly conserved orthologs (88.75% complete, 4.15% fragmented, and 7.11% missing). By aligning the

Illumina sequencing reads (PRJNA471201)3 to the genome using BWA (v0.7.15)24, the read-mapping rate was 99.00%. This indicates a high mapping efficiency. Thus, the above results indicated

that we obtained a high-quality genome of the _E. carinicauda_. CODE AVAILABILITY No specific code was used in this study. The data analyses used standard bioinformatic tools specified in

the methods. REFERENCES * _World Register of Marine Species_ https://www.marinespecies.org (2024). * Zhang, Q., Zhang, C., Yu, Y. & Li, F. Analysis of genetic diversity and population

structure of the ridgetail white prawn _Exopalaemon carinicauda_ in China. _Aquacult Rep._ 27, 101369 (2022). Google Scholar * Li, J. _et al_. Genome survey and high-resolution backcross

genetic linkage map construction of the ridgetail white prawn _Exopalaemon carinicauda_ applications to QTL mapping of growth traits. _Bmc Genomics._ 20, 598 (2019). Article PubMed PubMed

Central Google Scholar * Ge, Q., Li, Z., Li, J., Wang, J. & Li, J. Effects of acute salinity stress on the survival and prophenoloxidase system of _Exopalaemon carinicauda_. _Acta

Oceanol Sin._ 39, 57–64 (2020). Article CAS Google Scholar * Wang, X., Yan, B., Ma, S. & Dong, S. Study on The Biology and Cultural Ecology of _Exopalaemon carinicauda_. _Shandong

Fisheries._ 22, 21–24 (2005). Google Scholar * Huan, G. _et al_. Analysis to the Activities of Five Factors in Response to Temperature in _Exopalaemon carinicauda_. _Journal of Huaihai

Institute of Technology._ 23, 72–75 (2014). Google Scholar * Zhang, Z. _et al_. Effects of adding EM bacteria and mechanical aeration on water quality, growth and antioxidant status of

_Meretrix meretrix_ and _Exopalaemon carinicauda_ farmed in the clam–shrimp polyculture system. _Aquac Res._ 53, 1823–1832 (2022). Article CAS Google Scholar * Gui, T. _et al_.

CRISPR/Cas9-Mediated Genome Editing and Mutagenesis of EcChi4 in _Exopalaemon carinicauda_. _G3 Genes Genom Genet._ 6, 3757–3764 (2016). Article CAS Google Scholar * Miao, M. _et al_.

CRISPR/Cas9-mediated gene mutation of EcIAG leads to sex reversal in the male ridgetail white prawn _Exopalaemon carinicauda_. _Front Endocrinol._ 14, 1266641 (2023). Article Google Scholar

* Gao, Y. _et al_. CRISPR/Cas9-mediated mutation on an insulin-like peptide encoding gene affects the growth of the ridgetail white prawn _Exopalaemon carinicauda_. _Front Endocrinol._ 13,

986491 (2022). Article Google Scholar * Yuan, J. _et al_. Genome Sequences of Marine Shrimp _Exopalaemon carinicauda_ Holthuis Provide Insights into Genome Size Evolution of Caridea. _Mar

Drugs._ 15, 213–230 (2017). Article PubMed PubMed Central Google Scholar * Uengwetwanit, T. _et al_. A chromosome-level assembly of the black tiger shrimp (_Penaeus monodon_) genome

facilitates the identification of growth-associated genes. _Mol Ecol Resour._ 21, 1620–1640 (2021). Article CAS PubMed PubMed Central Google Scholar * Wang, Q. _et al_. Improved genome

assembly of Chinese shrimp (_Fenneropenaeus chinensis_) suggests adaptation to the environment during evolution and domestication. _Mol Ecol Resour._ 22, 334–344 (2022). Article CAS PubMed

Google Scholar * Zhang, X. _et al_. Penaeid shrimp genome provides insights into benthic adaptation and frequent molting. _Nat Commun._ 10, 356 (2019). Article ADS CAS PubMed PubMed

Central Google Scholar * Li, Y., Liu, P., Li, J., Li, J. & Gao, B. The chromosome preparation and karyotype in the ridgetail white prawn _Exopalaemon carinicauda_. _Journal of Dalian

Ocean University._ 27, 453–456 (2012). Google Scholar * Jiang, Q., Xie, S., Zhou, Q. & Lan, W. Chromosome Karyotype in Freshwater Prown _Exopalaemon modestus_. _Fisheries Science._ 27,

470–472 (2008). Google Scholar * Liu, B., _et al_. _Estimation of genomic characteristics by analyzing k-mer frequency in de novo genome projects_ (2013). * Marçais, G. & Kingsford, C.

A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. _Bioinformatics._ 27, 764–770 (2011). Article PubMed PubMed Central Google Scholar * Vurture, G. W.

_et al_. GenomeScope: fast reference-free genome profiling from short reads. _Bioinformatics._ 33, 2202–2204 (2017). Article CAS PubMed PubMed Central Google Scholar * Durand, N. C. _et

al_. Juicer Provides a One-Click System for Analyzing Loop-Resolution Hi-C Experiments. _Cell Syst._ 3, 95–98 (2016). Article CAS PubMed PubMed Central Google Scholar * Dudchenko, O.

_et al_. De novo assembly of the _Aedes aegypti_ genome using Hi-C yields chromosome-length scaffolds. _Science._ 356, 92–95 (2017). Article ADS CAS PubMed PubMed Central Google Scholar

* Durand, N. C. _et al_. Juicebox Provides a Visualization System for Hi-C Contact Maps with Unlimited Zoom. _Cell Syst._ 3, 99–101 (2016). Article CAS PubMed PubMed Central Google

Scholar * Wolff, J. _et al_. Galaxy HiCExplorer 3: a web server for reproducible Hi-C, capture Hi-C and single-cell Hi-C data analysis, quality control and visualization. _Nucleic Acids

Res._ 48, W177–w184 (2020). Article CAS PubMed PubMed Central Google Scholar * Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform.

_Bioinformatics._ 25, 1754–1760 (2009). Article CAS PubMed PubMed Central Google Scholar * Seppey, M., Manni, M. & Zdobnov, E. M. BUSCO: Assessing Genome Assembly and Annotation

Completeness. _Methods Mol Biol._ 1962, 227–245 (2019). Article CAS PubMed Google Scholar * Han, Y. & Wessler, S. R. MITE-Hunter: a program for discovering miniature inverted-repeat

transposable elements from genomic sequences. _Nucleic Acids Res._ 38, e199 (2010). Article PubMed PubMed Central Google Scholar * Ellinghaus, D., Kurtz, S. & Willhoeft, U.

LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. _Bmc Bioinformatics._ 9, 18 (2008). Article PubMed PubMed Central Google Scholar * Xu, Z.

& Wang, H. LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. _Nucleic Acids Res._ 35, W265–268 (2007). Article PubMed PubMed Central Google Scholar

* Ou, S. & Jiang, N. LTR_retriever: A Highly Accurate and Sensitive Program for Identification of Long Terminal Repeat Retrotransposons. _Plant Physiol._ 176, 1410–1422 (2018). Article

CAS PubMed Google Scholar * Chen, N. Using RepeatMasker to identify repetitive elements in genomic sequences. _Curr Protoc Bioinformatics._ CHAPTER 4, Unit 4.10 (2004). PubMed Google

Scholar * Flynn, J. M. _et al_. RepeatModeler2 for automated genomic discovery of transposable element families. _Proc Natl Acad Sci USA_ 117, 9451–9457 (2020). Article ADS CAS PubMed

PubMed Central Google Scholar * Lowe, T. M. & Eddy, S. R. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. _Nucleic Acids Res._ 25, 955–964

(1997). Article CAS PubMed PubMed Central Google Scholar * Griffiths-Jones, S. _et al_. Rfam: annotating non-coding RNAs in complete genomes. _Nucleic Acids Res._ 33, D121–D124 (2005).

Article CAS PubMed Google Scholar * Nawrocki, E. P. & Eddy, S. R. Infernal 1.1: 100-fold faster RNA homology searches. _Bioinformatics._ 29, 2933–2935 (2013). Article CAS PubMed

PubMed Central Google Scholar * Stanke, M. _et al_. AUGUSTUS: ab initio prediction of alternative transcripts. _Nucleic Acids Res._ 34, W435–439 (2006). Article CAS PubMed PubMed

Central Google Scholar * Korf, I. Gene finding in novel genomes. _Bmc Bioinformatics._ 5, 59 (2004). Article PubMed PubMed Central Google Scholar * Majoros, W. H., Pertea, M. &

Salzberg, S. L. TigrScan and GlimmerHMM: two open source ab initio eukaryotic gene-finders. _Bioinformatics._ 20, 2878–2879 (2004). Article CAS PubMed Google Scholar * Lomsadze, A.,

Burns, P. D. & Borodovsky, M. Integration of mapped RNA-Seq reads into automatic training of eukaryotic gene finding algorithm. _Nucleic Acids Res._ 42, e119 (2014). Article PubMed

PubMed Central Google Scholar * Keilwagen, J., Hartung, F. & Grau, J. GeMoMa: Homology-Based Gene Prediction Utilizing Intron Position Conservation and RNA-seq Data. _Methods Mol

Biol._ 1962, 161–177 (2019). Article CAS PubMed Google Scholar * Kim, D., Paggi, J. M., Park, C., Bennett, C. & Salzberg, S. L. Graph-based genome alignment and genotyping with

HISAT2 and HISAT-genotype. _Nat Biotechnol._ 37, 907–915 (2019). Article CAS PubMed PubMed Central Google Scholar * Shi, K. _et al_. Full-length transcriptome sequences of ridgetail

white prawn _Exopalaemon carinicauda_ provide insight into gene expression dynamics during thermal stress. _Sci Total Environ._ 747, 141238 (2020). Article ADS CAS PubMed Google Scholar

* Trapnell, C. _et al_. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. _Nat Biotechnol._ 28, 511–515

(2010). Article CAS PubMed PubMed Central Google Scholar * Haas, B. J. _et al_. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. _Nucleic Acids

Res._ 31, 5654–5666 (2003). Article CAS PubMed PubMed Central Google Scholar * Haas, B. J. _et al_. Automated eukaryotic gene structure annotation using EVidenceModeler and the Program

to Assemble Spliced Alignments. _Genome Biol._ 9, R7 (2008). Article PubMed PubMed Central Google Scholar * Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic

local alignment search tool. _J Mol Biol._ 215, 403–410 (1990). Article CAS PubMed Google Scholar * _NCBI Sequence Read Archive_ https://identifiers.org/ncbi/insdc.sra:SRR27880589

(2024). * _NCBI Sequence Read Archive_ https://identifiers.org/ncbi/insdc.sra:SRR27756800 (2024). * _NCBI Sequence Read Archive_ https://identifiers.org/ncbi/insdc.sra:SRR27756801 (2024). *

_NCBI Sequence Read Archive_ https://identifiers.org/ncbi/insdc.sra:SRR27862044 (2024). * _NCBI Sequence Read Archive_ https://identifiers.org/ncbi/insdc.sra:SRR27862045 (2024). * _NCBI

Sequence Read Archive_ https://identifiers.org/ncbi/insdc.sra:SRR27880535 (2024). * _NCBI Sequence Read Archive_ https://identifiers.org/ncbi/insdc.sra:SRR27880536 (2024). * _NCBI Sequence

Read Archive_ https://identifiers.org/ncbi/insdc.sra:SRR27880537 (2024). * _NCBI Sequence Read Archive_ https://identifiers.org/ncbi/insdc.sra:SRR27880538 (2024). * _NCBI Sequence Read

Archive_ https://identifiers.org/ncbi/insdc.sra:SRR27880539 (2024). * _NCBI Sequence Read Archive_ https://identifiers.org/ncbi/insdc.sra:SRR27880540 (2024). * _NCBI GenBank_

https://identifiers.org/ncbi/insdc:JAZBEV000000000 (2024). Download references ACKNOWLEDGEMENTS This research was funded by National Key Research and Development Program of China (No.

2023YFD2401001), National Natural Science Foundation of China (32072974), China Agriculture Research System of MOF and MARA (CARS-48) and the Central Public-interest Scientific Institution

Basal Research Fund, CAFS (2023TD50). AUTHOR INFORMATION Author notes * These authors contributed equally: Jiajia Wang, Jianjian Lv, Miao Shi. AUTHORS AND AFFILIATIONS * State Key Laboratory

of Mariculture Biobreeding and Sustainable Goods, Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao, Shandong, 266071, China Jiajia Wang, Jianjian Lv,

Qiong Wang, Yuying He, Jian Li & Jitao Li * Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao Marine Science and Technology Center, Qingdao, Shandong,

266237, China Jiajia Wang, Jianjian Lv, Qianqian Ge, Qiong Wang, Yuying He, Jian Li & Jitao Li * Berry Genomics Co., Ltd., Beijing, China Miao Shi Authors * Jiajia Wang View author

publications You can also search for this author inPubMed Google Scholar * Jianjian Lv View author publications You can also search for this author inPubMed Google Scholar * Miao Shi View

author publications You can also search for this author inPubMed Google Scholar * Qianqian Ge View author publications You can also search for this author inPubMed Google Scholar * Qiong

Wang View author publications You can also search for this author inPubMed Google Scholar * Yuying He View author publications You can also search for this author inPubMed Google Scholar *

Jian Li View author publications You can also search for this author inPubMed Google Scholar * Jitao Li View author publications You can also search for this author inPubMed Google Scholar

CONTRIBUTIONS J.W., J.L. and J.L. (Jitao Li) conceived and designed the study. Q.G. and Q.W. prepared the material. J.W., J.L. (Jianjian Lv) and M.S. analyzed the data. J.W. and Y.H.

prepared the results. J.W. drafted the manuscript. J.L. (Jianjian Lv) and J.L. (Jitao Li) edited and improved the manuscript. All authors read and approved the final manuscript.

CORRESPONDING AUTHOR Correspondence to Jitao Li. ETHICS DECLARATIONS COMPETING INTERESTS The authors declare no competing interests. ADDITIONAL INFORMATION PUBLISHER’S NOTE Springer Nature

remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. RIGHTS AND PERMISSIONS OPEN ACCESS This article is licensed under a Creative Commons

Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original

author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the

article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use

is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

http://creativecommons.org/licenses/by/4.0/. Reprints and permissions ABOUT THIS ARTICLE CITE THIS ARTICLE Wang, J., Lv, J., Shi, M. _et al._ Chromosome-level genome assembly of ridgetail

white shrimp _Exopalaemon carinicauda_. _Sci Data_ 11, 576 (2024). https://doi.org/10.1038/s41597-024-03423-9 Download citation * Received: 04 March 2024 * Accepted: 24 May 2024 * Published:

04 June 2024 * DOI: https://doi.org/10.1038/s41597-024-03423-9 SHARE THIS ARTICLE Anyone you share the following link with will be able to read this content: Get shareable link Sorry, a

shareable link is not currently available for this article. Copy to clipboard Provided by the Springer Nature SharedIt content-sharing initiative