Pull Alignment Character Position

Question

Pull Alignment Character Position

485 views Asked by user1357015 At 03 June 2012 at 18:16

I use pairwise align to get the following:

> alignment <-pairwiseAlignment(pattern = canonical.protein, subject=protein.extracted)
> alignment
Global PairwiseAlignedFixedSubject (1 of 1)
pattern: [448]          DDWEIPDGQITVGQRIGSGSFGTVYKGKWHGDVAVKMLNVTAPTPQQLQAFKNEVGV...FMVGRGYLSPDLSKVRSNCPKAMKRLMAE  CLKKKRDERPLFPQILASIELLARSLPK 
subject:   [1]     DDWEIPDGQITVGQRIGSGSFGTVYKGKWHGDVAVKMLNVTAPTPQQLQAFKNEVGV...FMVGRGYLSPDLSKVRSNCPKAMKRLMAECLKKKRDERPLFPQILASIELLARSLPK 
score: -912.3752

I can then use:

toString(pattern(alignment))
toString(subject(alignment))

to get the full string sequence for both the pattern and the subject. However, how do I get the number 448 and 1 out of the object as an integer? I need to use these numbers but there doesn't seem to be a way to get at them.

Original Q&A

There are 2 answers

Niek de Klein On 03 June 2012 at 20:03

Since you can make a string out of the alignment you can use R's string functions. You can do substr(toString(pattern(alignment)), 448, 448) to get the 448th character. I'm not familiar with that library so there might be an inbuilt way that I don't know of. See http://www.statmethods.net/management/functions.html for string functions in R.

**Martin Morgan** · Accepted Answer · 2012-06-03T18:38:53+00:00

I believe these are the starts of the alignments, so

start(pattern(alignment))

Your question would be clearer with a fully reproducible example, e.g.,

library(Biostrings)
example(pairwiseAlignment)
aln <- pairwiseAlignment(AAString("PAWHEAE"), AAString("HEAGAWGHEE"),
    substitutionMatrix = "BLOSUM50", gapOpening = 0, gapExtension = -8)

Then

> aln
Global PairwiseAlignedFixedSubject (1 of 1)
pattern: [1] PA--W-HEAE
subject: [2] EAGAWGHE-E
score: 1
> start(subject(aln))
[1] 2

Also, the Bioconductor mailing list is more appropriate for these questions; no subscription required.

TechQA.

Pull Alignment Character Position

There are 2 answers

Related Questions in R

Related Questions in BIOCONDUCTOR

Related Questions in PROTEIN-DATABASE

Popular Questions

Popular Tags

Trending Questions