Data mining is the practice of examining large databases in order to generate new information by finding hidden patterns and relationships in large databases and inferring rules from them to predict the future behavior. The information that can be obtained from data mining include associations, sequences, classifications, clusters, and forecasts.
Explain how text mining and Web mining differ from conventional data mining.
Conventional data mining focuses on data that has been structured in databases and files. Text mining focuses on finding patterns and trends in unstructured data contained in text files. Web mining looks for patterns in a few different ways. They extract knowledge from Web pages through a process called Web content mining.
…show more content…
Conventional databases can be linked through middle ware to the Web or Web interface to facilitate user access to an organization's internal data. Web browser software can be used to access a corporate website. The Web browser requests data from the organization's database using HTML to communicate with the Web server. Since many corporate databases can't interpret HTML commands, the Web server passes the requests to the middleware software that translates HTML to SQL enabling them to be processed by the DBMS. The middleware transfers the information from the organization's internal database back to the Web server for delivery in the form of a Web page to the user.
5. Why are information policy, data administration, and data quality assurance essential for managing the firm's data resources?
Describe the roles of information policy and data administration in information management.
An information policy sets the rules for an organization in terms of sharing, disseminating, acquiring, standardizing, classifying, and inventorying information. It specifies who, where, and when information can be shared as well as laying out the procedures for how to do