make specific ftp download paths for a dataframe with ftp_paths and assembly accessions
Source:R/genome_download_tools.R
make_download_urls.Rd
make specific ftp download paths for a dataframe with ftp_paths and assembly accessions
Arguments
- data
a dataframe with the columns 'ftp_path' and 'asm_acc' ftp_path should be a column produced by the function make_ftp_paths()
- type
type of download path to generate, one of: 'fna', 'gbff', 'gff', 'gtf', 'faa', 'cds'
Examples
make_download_urls(klebsiella_example_dat, type='fna')
#> # A tibble: 200 × 70
#> `#label` FDA_lab_id HHS_region IFSAC_category LibraryLayout
#> <chr> <chr> <chr> <chr> <chr>
#> 1 PDT001211141.1|SAMN240097… NULL NULL NULL NULL
#> 2 PDT000434574.1|SAMN087838… NULL NULL NULL PAIRED
#> 3 PDT001009222.1|SAMN185110… NULL NULL NULL NULL
#> 4 PDT000414882.1|SAMN102412… NULL NULL NULL NULL
#> 5 PDT000431384.1|SAMN075255… NULL NULL NULL PAIRED
#> 6 PDT001020147.1|SAMN190166… NULL NULL NULL PAIRED
#> 7 PDT000875073.1|SAMN064386… NULL NULL NULL PAIRED
#> 8 PDT000172180.4|SAMN059560… NULL NULL NULL NULL
#> 9 PDT000852554.1|SAMN149675… NULL NULL NULL NULL
#> 10 PDT001025462.1|SAMN189771… NULL NULL NULL PAIRED
#> # ℹ 190 more rows
#> # ℹ 65 more variables: PFGE_PrimaryEnzyme_pattern <chr>,
#> # PFGE_SecondaryEnzyme_pattern <chr>, Platform <chr>, Run <chr>,
#> # asm_acc <chr>, asm_level <chr>, asm_stats_contig_n50 <dbl>,
#> # asm_stats_length_bp <dbl>, asm_stats_n_contig <dbl>, assembly_method <chr>,
#> # attribute_package <chr>, bioproject_acc <chr>, bioproject_center <chr>,
#> # biosample_acc <chr>, isolate_identifiers <chr>, collected_by <chr>, …