A cooperative framework for molecular biology database integration using image object selection

PhD thesis


Khan, N. 2004. A cooperative framework for molecular biology database integration using image object selection. PhD thesis Middlesex University School of Computing Science
TypePhD thesis
TitleA cooperative framework for molecular biology database integration using image object selection
AuthorsKhan, N.
Abstract

The theme and the concept of 'Molecular Biology Database Integration' and the problems associated with this concept initiated the idea for this Ph.D research. The available technologies facilitate to analyse the data independently and discretely but it fails to integrate the data resources for more meaningful information. This along with the integration issues created the scope for this Ph.D research. The research has reviewed the 'database interoperability' problems and it has suggested a framework for integrating the molecular biology databases. The framework has proposed to develop a cooperative environment to share information on the basis of common purpose for the molecular biology databases. The research has also reviewed other implementation and interoperability issues for laboratory based, dedicated and target specific database. The research has addressed the following issues: diversity of molecular biology databases schemas, schema constructs and schema implementation multi-database query using image object keying, database integration technologies using context graph, automated navigation among these databases. This thesis has introduced a new approach for database implementation. It has introduced an interoperable component database concept to initiate multidatabase query on gene mutation data. A number of data models have been proposed for gene mutation data which is the basis for integrating the target specific component database to be integrated with the federated information system. The proposed data models are: data models for genetic trait analysis, classification of gene mutation data, pathological lesion data and laboratory data. The main feature of this component database is non-overlapping attributes and it will follow non-redundant integration approach as explained in the thesis. This will be achieved by storing attributes which will not have the union or intersection of any attributes that exist in public domain molecular biology databases. Unlike data warehousing technique, this feature is quite unique and novel. The component database will be integrated with other biological data sources for sharing information in a cooperative environment. This involves developing new tools. The thesis explains the role of these new tools which are: meta data extractor, mapping linker, query generator and result interpreter. These tools are used for a transparent integration without creating any global schema of the participating databases. The thesis has also established the concept of image object keying for multidatabase query and it has proposed a relevant algorithm for matching protein spot in gel electrophoresis image. An object spot in gel electrophoresis image will initiate the query when it is selected by the user. It matches the selected spot with other similar spots in other resource databases. This image object keying method is an alternative to conventional multidatabase query which requires writing complex SQL scripts. This method also resolve the semantic conflicts that exist among molecular biology databases. The research has proposed a new framework based on the context of the web data for interactions with different biological data resources. A formal description of the resource context is described in the thesis. The implementation of the context into Resource Document Framework (RDF) will be able to increase the interoperability by providing the description of the resources and the navigation plan for accessing the web based databases. A higher level construct is developed (has, provide and access) to implement the context into RDF for web interactions. The interactions within the resources are achieved by utilising an integration domain to extract the required information with a single instance and without writing any query scripts. The integration domain allows to navigate and to execute the query plan within the resource databases. An extractor module collects elements from different target webs and unify them as a whole object in a single page. The proposed framework is tested to find specific information e.g., information on Alzheimer's disease, from public domain biology resources, such as, Protein Data Bank, Genome Data Bank, Online Mendalian Inheritance in Man and local database. Finally, the thesis proposes further propositions and plans for future work.

Department nameSchool of Computing Science
Institution nameMiddlesex University
Publication dates
Print21 Aug 2014
Publication process dates
Deposited21 Aug 2014
Completed2004
Output statusPublished
LanguageEnglish
File
Permalink -

https://repository.mdx.ac.uk/item/84vy6

Download files

  • 30
    total views
  • 42
    total downloads
  • 1
    views this month
  • 0
    downloads this month

Export as

Related outputs

ADAPT: Approach to Develop context-Aware solutions for Personalised asthma managemenT
Quinde, M., Augusto, J., Khan, N. and Van Wyk, A. 2020. ADAPT: Approach to Develop context-Aware solutions for Personalised asthma managemenT. Journal of Biomedical Informatics. 111, pp. 1-20. https://doi.org/10.1016/j.jbi.2020.103586
Context-aware solutions for asthma condition management: a survey
Quinde, M., Khan, N., Augusto, J., Van Wyk, A. and Stewart, J. 2020. Context-aware solutions for asthma condition management: a survey. Universal Access in the Information Society. 19 (3), pp. 571-593. https://doi.org/10.1007/s10209-018-0641-5
Using formal methods to guide the development of an asthma management system
Augusto, J., Quinde, M. and Khan, N. 2019. Using formal methods to guide the development of an asthma management system. 10th International Conference Dependable Systems, Services and Technologies. Leeds, United Kingdom 05 - 07 Jun 2019 IEEE. pp. 57-62 https://doi.org/10.1109/DESSERT.2019.8770017
Case-based reasoning for context-aware solutions supporting personalised asthma management
Quinde, M., Khan, N. and Augusto, J. 2019. Case-based reasoning for context-aware solutions supporting personalised asthma management. 18th International Conference on Artificial Intelligence and Soft Computing. Zakopane, Poland 16 - 20 Jun 2019 Cham Springer. https://doi.org/10.1007/978-3-030-20915-5_24
A Human-in-The-Loop context-aware system allowing the application of case-based reasoning for asthma management
Quinde, M., Khan, N., Augusto, J. and Van Wyk, A. 2019. A Human-in-The-Loop context-aware system allowing the application of case-based reasoning for asthma management. Duffy, V. (ed.) 21st International Conference on Human-Computer Interaction (HCI International 2019). Orlando, USA 26 - 31 Jul 2019 Springer. pp. 125-140 https://doi.org/10.1007/978-3-030-22219-2_10
Context aware virtual assistant with case-based conflict resolution in multi-user smart home environment
Ospan, B., Khan, N., Augusto, J., Quinde, M. and Nurgaliyev, K. 2018. Context aware virtual assistant with case-based conflict resolution in multi-user smart home environment. 2018 International Conference on Computing and Network Communications (CoCoNet). Astana, Kazakhstan 15 - 17 Aug 2018 IEEE Computer Society. pp. 36-44 https://doi.org/10.1109/CoCoNet.2018.8476898
Fake news and critical thinking in information evaluation
Georgiadou, E., Rahanu, H., Siakas, K., McGuinness, C., Edwards, J., Hill, V., Khan, N., Kirby, P., Cavanagh, J. and Knezevic, R. 2018. Fake news and critical thinking in information evaluation. Western Balkan Information Literacy Conference WBILC 2018. Bihac, Bosnia and Herzegovina 21 - 22 Jun 2018 pp. 50-71
Personalisation of context-aware solutions supporting asthma management
Quinde, M., Khan, N. and Augusto, J. 2018. Personalisation of context-aware solutions supporting asthma management. 16th International Conference on Computers Helping People with Special Needs. University of Linz, Austria 11 - 13 Jul 2018 Springer. pp. 510-519 https://doi.org/10.1007/978-3-319-94274-2_75
An improved model for GUI design of mHealth context-aware applications
Quinde, M. and Khan, N. 2018. An improved model for GUI design of mHealth context-aware applications. 20th International Conference on Human Computer Interaction (HCII2018). Las Vegas, USA 15 Jul 2018 Springer. pp. 313-326 https://doi.org/10.1007/978-3-319-91803-7_23
Improved multi-user interaction in a smart environment through a preference-based conflict resolution virtual assistant
Nurgaliyev, K., Di Mauro, D., Khan, N. and Augusto, J. 2017. Improved multi-user interaction in a smart environment through a preference-based conflict resolution virtual assistant. 13th International Conference on Intelligent Environments (IE’17). Seoul, South Korea 23 - 25 Aug 2017 Institute of Electrical and Electronics Engineers (IEEE). pp. 100-107 https://doi.org/10.1109/IE.2017.21
Data Warehouse and BI to catalize information use in health sector for decision making: a case study
Slum Ally, S. and Khan, N. 2016. Data Warehouse and BI to catalize information use in health sector for decision making: a case study. The 2016 International Conference on Computational Science and Computational Intelligence. Las Vegas, USA 15 - 17 Dec 2016 Institute of Electrical and Electronics Engineers (IEEE). pp. 92-97 https://doi.org/10.1109/CSCI.2016.0025
Is Context-aware Reasoning = Case-based Reasoning?
Khan, N., Alegre, U., Kramer, D. and Augusto, J. 2017. Is Context-aware Reasoning = Case-based Reasoning? Tenth Interdisciplinary Conference on Modelling and Using Context. Paris, France 20 - 23 Jun 2017 Springer. pp. 418-431 https://doi.org/10.1007/978-3-319-57837-8_35
Big data to optimise product strategy in electronic industry
Khan, N., Lakshmi Sabih, V., Georgiadou, E. and Repanovich, A. 2016. Big data to optimise product strategy in electronic industry. 3rd International Conference on Advances in Big Data Analytics. Las Vegas, USA 25 - 28 Jul 2016 CSREA Press.
The development of student learning and information literacy: a case study
Rahanu, H., Georgiadou, E., Khan, N., Colson, R., Hill, V. and Edwards, J. 2016. The development of student learning and information literacy: a case study. Education for Information. 32 (3), pp. 211-224. https://doi.org/10.3233/EFI-150959
The role information literacy in overcoming obstacles to learning and lifelong learning
Rahanu, H., Khan, N., Georgiadou, E. and Siakas, K. 2015. The role information literacy in overcoming obstacles to learning and lifelong learning. 7th International Conference on Education and New Learning Technologies. Barcelona, Spain 06 - 08 Jul 2015 IATED. pp. 1184-1194
Accelerated literacy and information literacy can be achieved through access to new technologies
Rahanu, H., Georgiadou, E., Ross, M. and Khan, N. 2015. Accelerated literacy and information literacy can be achieved through access to new technologies. The BCS Quality Specialist Group's 20th INSPIRE: International Conference for Process Improvement, Research and Education. Loughborough, United Kingdom 30 - 31 Mar 2015 Southampton Solent University. pp. 105-117
Dyslexia adaptive e-learning system based on multi-layer architecture
Alsobhi, A., Khan, N. and Rahanu, H. 2015. Dyslexia adaptive e-learning system based on multi-layer architecture. 2015 Science and Information Conference (SAI). London, United Kingdom 28 - 30 Jul 2015 IEEE. pp. 776-780
The development of student learning and information literacy: a case study [conference item]
Rahanu, H., Georgiadou, E., Khan, N., Colson, R., Hill, V. and Edwards, J. 2015. The development of student learning and information literacy: a case study [conference item]. 12th International Scientific Conference "Western Balkan Information Literacy". Bihac, Bosnia and Herzegovina 18 - 20 Jun 2015 pp. 25-37
Bridging the digital divide: towards shortening the road from illiteracy to information literacy
Georgiadou, E., Rahanu, H., Khan, N., Colson, R. and Sule, C. 2014. Bridging the digital divide: towards shortening the road from illiteracy to information literacy. 11th International Scientific Conference "Western Balkan Information Literacy". Bihac, Bosnia Hertzegovina 11 - 14 Jun 2014 pp. 65-76
Performance evaluation of Levenberg-Marquardt technique in error reduction for diabetes condition classification
Khan, N., Dhara, G. and Kandl, T. 2013. Performance evaluation of Levenberg-Marquardt technique in error reduction for diabetes condition classification. Procedia Computer Science. 18, pp. 2629-2637.
Information integration of drug discovery and clinical studies to support complex queries using an information supply chain framework
Kandl, T. and Khan, N. 2014. Information integration of drug discovery and clinical studies to support complex queries using an information supply chain framework. Journal of Software. 9 (5), pp. 1348-1356. https://doi.org/10.4304/jsw.9.5.1348-1356
Personalised learning materials based on dyslexia types: ontological approach
Alsobhi, A., Khan, N. and Rahanu, H. 2015. Personalised learning materials based on dyslexia types: ontological approach. 19th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems (KES 2015). Singapore Elsevier. pp. 113-121
DAEL framework: a new adaptive e-learning framework for students with dyslexia
Alsobhi, A., Khan, N. and Rahanu, H. 2015. DAEL framework: a new adaptive e-learning framework for students with dyslexia. International Conference On Computational Science, ICCS 2015 — Computational Science at the Gates of Nature. Reykjavík, Iceland 01 - 03 Jun 2015 pp. 1947-1956
Toward linking dyslexia types and symptoms to the available assistive technologies
Alsobhi, A., Khan, N. and Rahanu, H. 2014. Toward linking dyslexia types and symptoms to the available assistive technologies. The 14th IEEE International Conference on Advanced Learning Technologies - ICALT2014. Athens, Greece 07 - 10 Jul 2014 Institute of Electrical and Electronics Engineers (IEEE). pp. 597-598 https://doi.org/10.1109/ICALT.2014.174
True global optimality of the pressure vessel design problem: a benchmark for bio-inspired optimisation algorithms
Yang, X., Huyck, C., Karamanoglu, M. and Khan, N. 2013. True global optimality of the pressure vessel design problem: a benchmark for bio-inspired optimisation algorithms. International Journal of Bio-Inspired Computation. 5 (6), pp. 329-335. https://doi.org/10.1504/IJBIC.2013.058910
Trusted information supply chain framework in clinical studies
Kandl, T. and Khan, N. 2014. Trusted information supply chain framework in clinical studies. Advanced Science Letters. 20 (1), pp. 103-110. https://doi.org/10.1166/asl.2014.5310
Interactive visualization for low literacy users: from lessons learnt to design
Kodagoda, N., Wong, B., Rooney, C. and Khan, N. 2012. Interactive visualization for low literacy users: from lessons learnt to design. CHI '12 Proceedings of the 2012 ACM annual conference on Human Factors in Computing Systems. New York, NY, USA Association for Computing Machinery (ACM). pp. 1159-1168 https://doi.org/10.1145/2207676.2208565
A software based solution to facilitate end to end information supply chain visibility
Khan, N., Silva, S. and Kandl, T. 2012. A software based solution to facilitate end to end information supply chain visibility. Chen, J. and Su, Q. (ed.) 9th IEEE International Conference on Service System and Service Management (ICSSSM, 2012). Shanghai, China IEEE. pp. 850-855 https://doi.org/10.1109/ICSSSM.2012.6252359
Information seeking behaviour model as a theoretical lens: high and low literate users behaviour process analysed.
Kodagoda, N., Wong, B. and Khan, N. 2010. Information seeking behaviour model as a theoretical lens: high and low literate users behaviour process analysed. in: ECCE '10: Proceedings of the 28th Annual European Conference on Cognitive Ergonomics. Association for Computing Machinery (ACM). pp. 117-124
Open-­card sort to explain why low-literate users abandon their web searches early.
Kodagoda, N., Wong, B. and Khan, N. 2010. Open-­card sort to explain why low-literate users abandon their web searches early. in: BCS'10: Proceedings of the 24th BCS Interaction Specialist Group Conference. British Computer Society. pp. 433-442
Information integration of diverse laboratory data sources using information supply chains
Kandl, T. and Khan, N. 2011. Information integration of diverse laboratory data sources using information supply chains. in: Macedo, M. (ed.) Proceedings of the IADIS international conference e-health 2011 IADIS. pp. 69-80
A cooperative framework for molecular biology database integration using image object selection.
Khan, N. 2004. A cooperative framework for molecular biology database integration using image object selection. PhD thesis Middlesex University School of Computing Science
From local laboratory data to public domain database in search of indirect association of diseases: AJAX based gene data search engine.
Khan, N., Long, H., Rahman, S. and Stockman, A. 2007. From local laboratory data to public domain database in search of indirect association of diseases: AJAX based gene data search engine. in: Twentieth IEEE International Symposium on Computer-Based Medical Systems (CBMS'07). IEEE Computer Society Press. pp. 213-218
Knowledge extraction from microarray datasets using combined multiple models to predict leukemia types.
Stiglic, G., Khan, N. and Kokol, P. 2008. Knowledge extraction from microarray datasets using combined multiple models to predict leukemia types. in: Data mining: foundations and practice. Springer. pp. 339-352
Behaviour characteristics: low and high literacy users information seeking on social service websites.
Kodagoda, N., Wong, B. and Khan, N. 2009. Behaviour characteristics: low and high literacy users information seeking on social service websites. in: Proceedings of the 10th International Conference NZ Chapter of the ACM's Special Interest Group on Human-Computer Interaction. New York Association for Computing Machinery (ACM). pp. 13-16
Overview of behaviour characteristics of high and low literacy users: information seeking of an online social service system
Kodagoda, N., Wong, B. and Khan, N. 2009. Overview of behaviour characteristics of high and low literacy users: information seeking of an online social service system. in: Gross, T., Gulliksen, J., Kotzé, P., Oestreicher, L., Palanque, P., Prates, R. and Winckler, M. (ed.) Human-Computer Interaction – INTERACT 2009: 12th IFIP TC 13 International Conference, Uppsala, Sweden, August 24-28, 2009, Proceedigns Part I Springer.
A cooperative environment for genetic variance analysis using component database for database integration.
Khan, N., Stockman, A. and Rahman, S. 2002. A cooperative environment for genetic variance analysis using component database for database integration. in: Proceedings of the 15th IEEE international conference on computer based medical systems (CBMS). Computer Society Press.
Participatory pattern in asynchronous discussion forum: a cross-cultural perspective.
Khan, N. and Abeysinghe, G. 2009. Participatory pattern in asynchronous discussion forum: a cross-cultural perspective. 10th Annual Conference of the Subject Centre for Information and Computer Sciences. Canterbury, UK 25 - 27 Aug 2009 pp. 102-106
Gestational trophoblastic diseases: 2. Hyperglycosylated hCG as a reliable marker of active neoplasia
Cole, L., Butler, S., Khan, N., Giddings, A., Muller, C., Seckl, M. and Kohorn, E. 2006. Gestational trophoblastic diseases: 2. Hyperglycosylated hCG as a reliable marker of active neoplasia. Gynecologic Oncology. 102 (2), pp. 151-159. https://doi.org/10.1016/j.ygyno.2005.12.045
Interoperability and navigation between medical databases using context graph.
Rahman, S. and Khan, N. 2004. Interoperability and navigation between medical databases using context graph. in: Fourth international conference on intelligent systems design and applications (ISDA 2004). Budapest IEEE Computer Society.
Object modelling of gene mutation data for variance analysis.
Rahman, S., Khan, N. and International Institute of Informatics and Systemics. 2002. Object modelling of gene mutation data for variance analysis. in: Callaos, N. (ed.) Proceedings: the 6th world multiconference on systemics, cybernetics and informatics.[SCI 2002] Orlando, Florida. International Institute of Informatics and Systemics.. pp. 301-305
A conceptual object modelling of gene mutation data.
Rahman, S. and Khan, N. 2001. A conceptual object modelling of gene mutation data. in: Wingender, E. (ed.) Computer science and biology: proceedings of the German conference on bioinformatics. Braunschweig German Research Center for Biotechnology.. pp. 187-190
Integrating molecular biology databases using image object keying.
Rahman, S., Khan, N. and Stockman, A. 2003. Integrating molecular biology databases using image object keying. in: Krol, M., Mitra, S. and Lee, D. (ed.) 16th IEEE symposium on computer-based medical systems. Los Alamitos, Calif. IEEE Computer Society.
Identifying information seeking behaviours of low andhigh literacy users: combined cognitive task analysis.
Kodagoda, N., Wong, B. and Khan, N. 2009. Identifying information seeking behaviours of low andhigh literacy users: combined cognitive task analysis. British Computer Society. pp. 347-354
An approach to develop human gene disorder database for intelligent variance analysis of genes and its products.
Rahman, S., Khan, N. and Clarkson, T. 2001. An approach to develop human gene disorder database for intelligent variance analysis of genes and its products. in: 12th International workshop on Database and Expert Systems, Munich, Germany. Proceedings. Washington DC, USA IEEE Computer Society Press. pp. 301-305
Integration of biological data resources using image object keying.
Rahman, S., Stockman, A. and Khan, N. 2003. Integration of biological data resources using image object keying. in: 16th IEEE Symposium on Computer-based Medical Systems (CBMS'03), New York. Proceedings. . Washington DC, USA IEEE Computer Society Press.
A new approach to detect similar proteins from 2D Gel Electrophoresis Images.
Rahman, S. and Khan, N. 2003. A new approach to detect similar proteins from 2D Gel Electrophoresis Images. in: 3rd International Symposium on Bioinformatics and Bioengineering, Washington DC. Proceedings. Washington DC, USA IEEE Computer Society Press.
A cooperative environment for genetic variance analysis using component database for database integration.
Khan, N., Stockman, A. and Rahman, S. 2002. A cooperative environment for genetic variance analysis using component database for database integration. in: 15th IEEE Symposium on Computer-Based Medical Systems (CBMS 2002), Slovenia. IEEE Computer Society Press. pp. 365-368
A framework for molecular biology databases integration using context graph keying.
Khan, N., Stockman, A. and Rahman, S. 2004. A framework for molecular biology databases integration using context graph keying. in: 17th IEEE COmputer Based Medical System Conference (CBMS 2004), Bethesda, Maryland, USA. Proceedings. IEEE Computer Society Press. pp. 21-26
Prediction of Type II MODY3 diabetes using backpercolation.
Khan, N., Chukwuemeka, I. and Rahman, S. 2005. Prediction of Type II MODY3 diabetes using backpercolation. in: 18th IEEE Cmputer Based Medical System Conference, Dublin. Proceedings London IEEE Computer Society Press. pp. 401-403
Gene expression analysis of leukemia samples using visual interpretation of small ensembles: a case study.
Khan, N., Stiglic, G., Verlic, M. and Kokol, P. 2007. Gene expression analysis of leukemia samples using visual interpretation of small ensembles: a case study. in: 2nd International Conference in Pattern Recognition in Bioinformatics, Singapore. Lecture Notes in Computer Science Heidelberg Springer Berlin.