wget -c -r -np -k -L -A "*metadata*" ftp://ftp.sanger.ac.uk/pub/gencode/Gencode_human/release_24/
可以把所有metadata数据下载到当前的文件夹下了。
检查里面的记录数:
cd /ftp.sanger.ac.uk/pub/gencode/Gencode_human/release_24
ls *gz |while read id;do (echo -n $id;echo -n " ";zcat $id |wc -l ) ;done
以下是输出的结果:
gencode.v24.metadata.Annotation_remark.gz 40879
gencode.v24.metadata.EntrezGene.gz 170466
gencode.v24.metadata.Exon_supporting_feature.gz 19193542
gencode.v24.metadata.Gene_source.gz 66206
gencode.v24.metadata.HGNC.gz 182831
gencode.v24.metadata.PDB.gz 94547
gencode.v24.metadata.PolyA_feature.gz 84652
gencode.v24.metadata.Pubmed_id.gz 209094
gencode.v24.metadata.RefSeq.gz 75365
gencode.v24.metadata.Selenocysteine.gz 119
gencode.v24.metadata.SwissProt.gz 45067
gencode.v24.metadata.Transcript_source.gz 217202
gencode.v24.metadata.Transcript_supporting_feature.gz 87375
gencode.v24.metadata.TrEMBL.gz 61924
Screen Shot 2017-09-07 at 11.38.20.png
还可以下载所有的gtf文件:
wget -c -r -np -nd -k -L -A “*gtf.gz” ftp://ftp.sanger.ac.uk/pub/gencode/Gencode_human/release_24/
旨在To create comprehensive reference maps of all human cells—the fundamental units of life—as a basis for both understanding human health and diagnosing, monitoring, and treating disease.