A three layer based Intelligent Data Privacy Protection Scheme in Cloud Storage

ABSTRACT:

Unsupervised Cross-domain Sentiment Classification is the task of adapting a sentiment classifier trained on a particular domain (source domain), to a different domain (target domain), without requiring any labeled data for the target domain. By adapting an existing sentiment classifier to previously unseen target domains, we can avoid the cost for manual data annotation for the target domain. We model this problem as embedding learning, and construct three objective functions that capture: (a) distributional properties of pivots (i.e., common features that appear in both source and target domains), (b) label constraints in the source domain documents, and (c) geometric properties in the unlabeled documents in both source and target domains. Unlike prior proposals that first learn a lower-dimensional embedding independent of the source domain sentiment labels, and next a sentiment classifier in this embedding, our joint optimisation method learns embeddings that are sensitive to sentiment classification. Experimental results on a benchmark dataset show that by jointly optimising the three objectives we can obtain better performances in comparison to optimizing each objective function separately, thereby demonstrating the importance of task-specific embedding learning for cross-domain sentiment classification. Among the individual objective functions, the best performance is obtained by (c). Moreover, the proposed method reports cross-domain sentiment classification accuracies that are statistically comparable to the current state-of-the-artembedding learning methods for cross-domain sentiment classification.

PROJECT OUTPUT VIDEO:

EXISTING SYSTEM:

  • One popular solution to cross-domain sentiment classificationis to first project the source and the target features into the same lower-dimensional embedding, and subsequently learn a sentiment classifier on this embedded feature space.
  • This approach is particularly attractive when there is little overlap between the original source and the target feature spaces. Similarly distributed words in the source and the target domains get mapped to closer points in the embedded space, thereby reducing the mismatch of features in the two domains.
  • Prior work on cross-domain sentiment classification use unlabeled data from the source and the target domains to first learn a low-dimensional embedding for the two domains.
  • Next, labeled reviews in the source domain are projected onto this embedding.Finally, a binary sentiment classifier is trained using the projected source domain labeled training instances.

DISADVANTAGES OF EXISTING SYSTEM:

  • A limitation of existing two-step approach that decouples the embedding learning and sentiment classifier training is that the embeddings learnt in the first step is agnostic to the sentiment of the documents, which is the ultimate goal in cross-domain sentiment classification.
  • This method does not consider source domain labeled data during the PLSR step, which makes the projection agnostic to sentiment classification.
  • Supervised dimensionality reduction methods are prone to over fitting when the number of labeled instancesare small.

PROPOSED SYSTEM:

  • We propose an unlabeled cross-domain sentiment classification method using spectral embeddings where we project both the words and the documents into the same lower dimensional embedding.
  • The embedding learnt by our method enforces three important requirements.
  • First, a set of domain independent features (also known as pivots) areselected from the source and target domains which must be mapped as close as possible in the embedded space.
  • Second,friend closeness and enemy dispersion of the source domain labeled documents must be preserved. In other words, positively labeled documents must be embedded closer to each other and far from the negatively labeled documents. Likewise,negatively labeled documents must be embedded closer to each other and far from the positively labeled documents.
  • Third, within each domain, the local geometry among the documents must be preserved. For example,unlabeled neighbour documents in the source domainmust be embedded closer to each other in the embedded space whereas, unlabeled neighbour documents in the targetdomain must be embedded closer to each other in the embedded space. Here, neighbour documents refer to similar documents in terms of their text content.
  • We model each of the above-mentioned requirements as an objective function,and jointly optimise all three objective functions.

ADVANTAGES OF PROPOSED SYSTEM:

  • The proposed method can be easily extended to more than two sentiment classes.
  • Our experimental results on a benchmark data set formulti-domain sentiment classification demonstrate that by jointly optimising the three objectives in many cases we obtain better classification accuracies than if we had optimized each objective separately.
  • Even in cases where joint optimisation does not improve over the separately trained objectives, the performance obtained by the joint optimization method is never below that obtained by the best individually trained methods.
  • This result shows the importance of learning embeddings that are sensitive to the final task at hand, which is sentiment classification.
  • Moreover, the proposed method significantly outperforms several base lines and previously proposed embedding learning methods when applied to cross-domain sentiment classification.

MODULES:

  • System Model
  • Product Reviews
  • Search Users
  • Sentiment Reviews for Cross domains

MODULES DESCSRIPTION:

System Model

  • In this module, the Admin has to login by using valid user name and password. After login successful he can do some operations such as search history, view users, request & response, all topic messages and topics.This is controlled by admin; the admin can view the search history details. If he clicks on search history button, it will show the list of searched user details with their tags such as user name, searched user, time and date.In this module, the admin can view the all the friend request and response. Here all the request and response will be stored with their tags such as Id, requested user photo, requested user name, user name request to, status and time & date. If the user accepts the request then status is accepted or else the status is waiting.In this module, there are n numbers of users are present. User should register before doing some operations. And register user details are stored in user module. After registration successful he has to login by using authorized user name and password. Login successful he will do some operations like view or search users, send friend request, view messages, send messages, Tweet segmentation messages and followers.

Product Reviews

  • In this module, the admin can view the messages such as emerging products and their reviews. Tweet segmentation topic messages means we can send a message to particular user.
  • The main strategy of mapping the words and documents to the space is to first compute the word embeddings, and then derive the document embeddings based on the word embeddings by considering the word occurrences.

Search Users

  • The user can search the users based on users and the server will give response to the user like User name, user image, E mail id, phone number and date of birth. If you want send friend request to particular receiver then click on follow, then request will send to the user.
  • User can view the messages, send messages and send anomaly messages to users. User can send messages based on topic to the particular user, after sending a message that topic rank will be increased. Then again another user will also re- tweet the particular topic then that topic rank will increases. The anomaly message means user wants send a message to all users.

Sentiment Reviews for Cross domains

  • The constructed embedding space preserves the local connections between documents built upon the pivots and non-pivot words within each domain, meanwhile it aligns the source and target domains by matching the distributions of the pivots. This potentially enables the two structured spaces of source and target to be connected together by the pivot words. As a result, the enhancement of the class separation in the source domain will be propagated to the target domain. This thus achieves unsupervised sentiment classification in the target domain through the use of supervised information in source domain enabled by the cross-domain alignment driven by the pivot information.

SYSTEM REQUIREMENTS:

HARDWARE REQUIREMENTS:

  • System : Pentium Dual Core.
  • Hard Disk : 120 GB.
  • Monitor : 15 LED
  • Input Devices : Keyboard, Mouse.
  • Ram : 1 GB

SOFTWARE REQUIREMENTS:

  • Operating system : Windows 7.
  • Coding Language : JAVA
  • Frontend : JSP, HTML, CSS, JavaScript.
  • IDE Tool : Netbeans 7.2.1.
  • Database : MYSQL