Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to deal with FAMIS data? #156

Open
vegetablebird123 opened this issue Aug 19, 2021 · 13 comments
Open

How to deal with FAMIS data? #156

vegetablebird123 opened this issue Aug 19, 2021 · 13 comments

Comments

@vegetablebird123
Copy link

Hello everyone ,

I am confused that how to edit the experimental design for FAMIS data?
Now I have raw data with 2 compensation voltage(-45 and -70 v). My experimental design shows below
C2D932E9-68A0-43D2-87E3-648941F70CA7
But the output.csv is not what I expect. For example,It shows the intensity of sample001, -45 and -70v ,respectively. What I expected is every sample gives me 1 intensity, not distinguished by voltage.

So is there anything wrong with my experimental design and how can I fix it? Thanks.

@jpfeuffer
Copy link
Collaborator

jpfeuffer commented Aug 24, 2021

Hi!

That is a good question. Your approach actually could work. But you should probably declare the different voltages as different fractions.

1 1 s1v1.mzML 1
1 2 s1v2.mzML 1
2 1 s2v1.mzML 1
2 2 s2v2.mzML 1
...

(if you post your design as a Markdown table, I can edit it for you)

@vegetablebird123
Copy link
Author

Thank you for your reply. I used your approach shows below
9F53EA38-EB4B-4B46-82FD-65C045AF8168

my command shows here

/home/host/software/nextflow run nf-core/proteomicslfq
--input '/path/to/rawdata/.raw'
--database '/path/to/fasta/
.fasta'
-profile docker
--max_memory 600.GB
--max_cpus 120
--expdesign 'expdesign.tsv'
--add_decoys

But it also reminds me an error
874A970A-466C-4651-9693-3930A76B9B93

my command err shows here
8DB82248-84BA-4B21-A335-3DDAF4457D3D
Do you know how to fix the error?
Thank you so much.

@jpfeuffer
Copy link
Collaborator

Hello,

Do you really want to compare across FAIMS voltages?

If not, try to make the conditions 001,008,010. And in your case, Bio replicate should always be 1. (Unless the voltages used different cell material of course).

@vegetablebird123
Copy link
Author

It works finally. Thanks! But you misunderstood me. I do not want to compare intensity between voltages. Actually,I want it gives me one intensity per sample. Can you help me?

@jpfeuffer
Copy link
Collaborator

Great! Actually, I assumed that already.

If not, try to ..

So, have a look. You should have already intensities per sample.

@vegetablebird123
Copy link
Author

Here is my out.csv. It also gives me the intensities separated by different voltage per sample. And I find that there are duplicated protein names between one sample with different voltages, for example, 4000 proteins are duplicated in sample1-45v and sample 1-70v. And their intensities are different in 2 references. So which reference should I accept for the intensity of duplicated protein?thank s.
Uploading 9F3B7EC4-A023-4A2B-986A-D30BFF53771F.png…

@vegetablebird123
Copy link
Author

5EA0D816-8EB5-4160-A8A0-55DCE3AF08C4

@jpfeuffer
Copy link
Collaborator

Hmm yes, this does not look correct. Can you show your current experimental design?

@vegetablebird123
Copy link
Author

D170392F-1EB5-4FD5-B1FD-685EC4BF41BE

Here!

@jpfeuffer
Copy link
Collaborator

Can you try with consecutive numbers starting from 1 in each column?
That means fractions 1,2,1,2,..
And Conditions 1,1,2,2,..

@vegetablebird123
Copy link
Author

Still the same results.

AC081726-F80D-4129-BB0C-5BE26ABBD102
9E34FCAB-33E1-4463-B7DF-A714F41871EE

@jpfeuffer
Copy link
Collaborator

I disagree. Now it seems like you have the correct annotation of conditions.
What were you expecting?

Do you run the MSstats step? If so, have a look in the msstats/msstats_output.csv. There the intensities should be aggregated further already.

@jpfeuffer
Copy link
Collaborator

Regarding your "duplication problem": Duplicated proteins are not the problem. This will be common. The only thing that needs to be resolved are peptides with the same sequence, modification and same charge state that might appear at different voltages/fractions.
A common thing to do for fractions is to take the highest intensity feature.

But note that I would let MSstats do the work and just look at the MSstats results as mentioned in my previous comment.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants