このアイテムのアクセス数: 80

このアイテムのファイル:
ファイル 記述 サイズフォーマット 
s12859-022-04801-z.pdf1.97 MBAdobe PDF見る/開く
タイトル: Integer programming for selecting set of informative markers in paternity inference
著者: Nishiyama, Soichiro  kyouindb  KAKEN_id  orcid https://orcid.org/0000-0002-3451-6166 (unconfirmed)
Sato, Kengo
Tao, Ryutaro  kyouindb  KAKEN_id  orcid https://orcid.org/0000-0001-7811-5789 (unconfirmed)
著者名の別形: 西山, 総一郎
田尾, 龍太郎
キーワード: Optimization
Parentage
Population genetics
発行日: 8-Jul-2022
出版者: Springer Nature
BMC
誌名: BMC Bioinformatics
巻: 23
論文番号: 265
抄録: BACKGROUND: Parentage information is fundamental to various life sciences. Recent advances in sequencing technologies have made it possible to accurately infer parentage even in non-model species. The optimization of sets of genome-wide markers is valuable for cost-effective applications but requires extremely large amounts of computation, which presses for the development of new efficient algorithms. RESULTS: Here, for a closed half-sib population, we generalized the process of marker loci selection as a binary integer programming problem. The proposed systematic formulation considered marker localization and the family structure of the potential parental population, resulting in an accurate assignment with a small set of markers. We also proposed an efficient heuristic approach, which effectively improved the number of markers, localization, and tolerance to missing data of the set. Applying this method to the actual genotypes of apple (Malus × domestica) germplasm, we identified a set of 34 SNP markers that distinguished 300 potential parents crossed to a particular cultivar with a greater than 99% accuracy. CONCLUSIONS: We present a novel approach for selecting informative markers based on binary integer programming. Since the data generated by high-throughput sequencing technology far exceeds the requirement for parentage assignment, a combination of the systematic marker selection with targeted SNP genotyping, such as KASP, allows flexibly enlarging the analysis up to a scale that has been unrealistic in various species. The method developed in this study can be directly applied to unsolved large-scale problems in breeding, reproduction, and ecological research, and is expected to lead to novel knowledge in various biological fields. The implementation is available at https://github.com/SoNishiyama/IP-SIMPAT .
著作権等: © The Author(s) 2022.
This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
URI: http://hdl.handle.net/2433/284492
DOI(出版社版): 10.1186/s12859-022-04801-z
PubMed ID: 35804290
出現コレクション:学術雑誌掲載論文等

アイテムの詳細レコードを表示する

Export to RefWorks


出力フォーマット 


このアイテムは次のライセンスが設定されています: クリエイティブ・コモンズ・ライセンス Creative Commons