csrins

Student. Teacher. Softsmith.

Archive for July, 2006

Distributed Computing: Handouts for Day 12 Posted

Posted by csrins on July 28, 2006

The handouts for the topics covered on Day 12 – Security – are now posted online.

The remainder of the handouts will be posted over the course of this weekend.

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 2.5 License.

Posted in Distributed Computing, Education, M.Sc. Computer Science | Leave a Comment »

Data Warehousing and Mining Handouts Online

Posted by csrins on July 26, 2006

Head over to csrins.netfirms.com to download the lecture handouts for units 1 to 3.

Lecture for 27 July 2006 begins at 12:30 pm.
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 2.5 License.

Posted in Data Mining, Data Warehousing and Mining, Education, M.Sc. Computer Science | Leave a Comment »

Data Warehousing and Mining Discussion Group Now Online

Posted by csrins on July 26, 2006

An online discussion group to facilitate exchange of technical queries and solutions has been set up for the students enrolled in this course at R.D.National College.

Bonafide students of the course are advised to join up. This group will be one of the major channels of communication with the course faculty, and within the student community.

How do you join?

If you are not already subscribed, send me an email at the contact address in the following format:

Subject: MSCCS1 [name]: Join Data Warehousing and Mining

Body:

[Your Complete Name]

It is highly desirable, though not mandatory that your email address be something that’s easy enough to identify you; for example firstname.lastname@gmail.com, but if you wish to conduct business using something like piggywiggy@emailprovider.com that’s entirely up to you!

As mature post-graduate students, you are expected to follow proper netiquette in your dealings in the discussion group. Verbal abuse and off-topic discussions will not be tolerated. Be polite, conscientious, and dilligent, and make the most of the tools at your disposal.

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 2.5 License.

Posted in Data Mining, Data Warehousing and Mining, Education, M.Sc. Computer Science | Leave a Comment »

Syllabus: Data Warehousing and Mining [M.Sc. I] @ Mumbai Uni

Posted by csrins on July 26, 2006

Paper IV

Section-II

Objectives of the course: The data warehousing part of module aims to give students a good overview of the ideas and techniques which are behind recent development in the data warehousing and online analytical processing (OLAP) fields, in terms of data models, query language, conceptual design methodologies, and storage techniques. Data mining part of the model aims to motivate, define and characterize data mining as process; to motivate, define and characterize data mining applications.

Data Warehousing:

  1. Overview And Concepts: Need for data warehousing, Basic elements of data warehousing, Trends in data warehousing.
  2. Planning And Requirements: Project planning and management, Collecting the requirements.
  3. Architecture And Infrastructure: Architectural components, Infrastructure and metadata.
  4. Data Design And Data Representation: Principles of dimensional modeling, Dimensional modeling advanced topics, data extraction, transformation and loading, data quality.
  5. Information Access And Delivery: Matching information to classes of users, OLAP in data warehouse, Data warehousing and the web.
  6. Implementation And Maintenance: Physical design process, data warehouse deployment, growth and maintenance.

Data Mining:

  1. Introduction: Basics of data mining, related concepts, Data mining techniques.
  2. Data Mining Algorithms: Classification, Clustering, Association rules.
  3. Knowledge Discovery : KDD Process
  4. Web Mining: Web Content Mining, Web Structure Mining, Web Usage mining.
  5. Advanced Topics: Spatial mining, Temporal mining.
  6. Visualisation : Data generalization and summarization-based
  7. characterization, Analytical characterization: analysis of attribute relevance, Mining class comparisons: Discriminating between different classes, Mining descriptive statistical measures in large databases

  8. Data Mining Primitives, Languages, and System Architectures: Data mining primitives, Query language, Designing GUI based on a data mining query language, Architectures of data mining systems
  9. Application and Trends in Data Mining: Applications, Systems products and research prototypes, Additional themes in data mining, Trends in data mining

Text Books:

  1. Paulraj Ponniah, “Data Warehousing Fundamentals”, John Wiley.
  2. M.H. Dunham, “Data Mining Introductory and Advanced Topics”, Pearson Education.
  3. Han, Kamber, “Data Mining Concepts and Techniques”, Morgan Kaufmann
  4. Pieter Adriaans, Dolf Zantinge , “Data Mining”,  Pearson Education Asia

References:

  1. Ralph Kimball, “The Data Warehouse Lifecycle toolkit”, John Wiley.
  2. M Berry and G. Linoff, “Mastering Data Mining”, John Wiley.
  3. W.H. Inmon, “Building the Data Warehouses”, Wiley Dreamtech.
  4. R. Kimball, “The Data Warehouse Toolkit”, John Wiley.
  5. E.G. Mallach, “Decision Support and Data Warehouse systems”, TMH.

Practicals

Section II

Software used: Microsoft SQL Server 2000/7.0

1. Create a warehouse in MS SQL Server 2000 and import various databases from external sources such as Access/Excel/Text File by using Data Transformation Services (DTS) tool.

2. Create and schedule a DTS Package using Data Transformation services (DTS) tool. Fire at least 5 queries on the database.

3. Create a Database using Analysis Manager and create a Single-Dimensional OLAP cube by using STAR schema.

4. Create a Database using Analysis Manager and create a Multi-Dimensional OLAP cube by using Snowflake schema.

5. Create a Mining Model by using Relational Data.

6. Create a Mining Model by using OLAP Data.

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 2.5 License.

Posted in Data Mining, Data Warehousing and Mining, Education, M.Sc. Computer Science | Leave a Comment »

MSc @ RDNC: Lectures Resume 24th July 2006

Posted by csrins on July 23, 2006

DC and AI Lectures will resume from Monday, 24 July 2006.

Be there at 8:30 am!

Posted in Artificial Intelligence, Distributed Computing, Education, M.Sc. Computer Science | Leave a Comment »