logo Hurry, Grab up to 30% discount on the entire course
Order Now logo

Ask This Question To Be Solved By Our ExpertsGet A+ Grade Solution Guaranteed

expert
Arun SainiPolitical science
(5/5)

533 Answers

Hire Me
expert
Jane SimaaLaw
(5/5)

619 Answers

Hire Me
expert
Sean WooddEconomics
(4/5)

579 Answers

Hire Me
expert
Sarita VashisthEconomics
(5/5)

583 Answers

Hire Me
R Programming
(5/5)

use a local causal structure learning algorithm to find genes in the Markov blanket of ABCA9 from data. Explain how the algorithm works

INSTRUCTIONS TO CANDIDATES
ANSWER ALL QUESTIONS

Advanced Data Analytics 2 – Bioinformatics Project

BRCA-50 is a Breast cancer dataset, including the expression levels of 50 important genes in Breast cancer. The dataset includes 1212 samples with 112 samples are of normal cases (class = N) and 1100 samples are of cancer patients (class = C).

1. Use a causal structure learning algorithm to find the gene regulatory network, i.e. the network showing the interactions between genes, using the gene expression data. Explain how the algorithm works. (5) Hints: Please exclude the class variable in building the network

2. EBF1 is an important gene that is involved in many biological processes leading to cancer. Find the top 10 other genes that have strong causal effects on EBF1 using a causal inference algorithm. (5) Hints:

• Exclude the class variable in building the network

• If there are multiple possible causal effects between the cause and the effect, we can use the minimum of the absolute values (of the causal effects) as the final result

• The causal effects are normally ranked based on their absolute values.

3. Use a local causal structure learning algorithm to find genes in the Markov blanket of ABCA9 from data. Explain how the algorithm works. (5)

Discretise the dataset to binary using the average expression of ALL genes as the threshold. The discretised dataset will be used in the following questions.

4. Use PC-simple algorithm (pcSelect) to find the parent and children set of the class variable. Explain how PC-simple works. Evaluate the accuracy of the Naïve Bayes classification on the dataset in the following cases:

a) Use all features (genes) in the dataset

b) Use only the features (genes) in the parent and children set of the class variable

Compare the accuracy of the models in the two cases using 10-fold cross validation. (10)

(5/5)
Attachments:

Related Questions

. The fundamental operations of create, read, update, and delete (CRUD) in either Python or Java

CS 340 Milestone One Guidelines and Rubric  Overview: For this assignment, you will implement the fundamental operations of create, read, update,

. Develop a program to emulate a purchase transaction at a retail store. This  program will have two classes, a LineItem class and a Transaction class

Retail Transaction Programming Project  Project Requirements:  Develop a program to emulate a purchase transaction at a retail store. This

. The following program contains five errors. Identify the errors and fix them

7COM1028   Secure Systems Programming   Referral Coursework: Secure

. Accepts the following from a user: Item Name Item Quantity Item Price Allows the user to create a file to store the sales receipt contents

Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip

. The final project will encompass developing a web service using a software stack and implementing an industry-standard interface. Regardless of whether you choose to pursue application development goals as a pure developer or as a software engineer

CS 340 Final Project Guidelines and Rubric  Overview The final project will encompass developing a web service using a software stack and impleme