manifoldcf-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Karl Wright <daddy...@gmail.com>
Subject Re: Not able to see results of sharepoint crawls?
Date Thu, 29 May 2014 17:03:37 GMT
Hi Lalit,

Have you added any metadata rules on the job's Metadata tab?

See
http://manifoldcf.apache.org/release/trunk/en_US/end-user-documentation.html#sharepointrepository
.

Karl


On Thu, May 29, 2014 at 12:32 PM, lalit jangra <lalit.j.jangra@gmail.com>
wrote:

> Thanks Karl,
>
> With your help, i am able to content indexed in my solr with logs as below
> with some meaningful value to literal.allow_token_document variable. But
> now i am struggling with not able to get any property indexed from
> sharepoint to solr. On MCF job page, i have put sharepoint content
> properties  such as Name, Title, GUID etc. which are mapped to fields in
> my solr schema but i am able to see only GUID property filled with metadata
> & not any other.
>
> Can you help here?
>
> content_id={25CAEB55-ECEB-4ACC-A45F-78E35E877024}&literal.id=
> http://testirishwaterportal/sites/hr/Documents/A2.docx&resource.name=A2.docx&literal.allow_token_document=Agrp:GApprovers&literal.allow_token_document=Agrp:GDesigners&literal.allow_token_document=Agrp:GHR%2BMembers&literal.allow_token_document=Agrp:GHR%2BOwners&literal.allow_token_document=Agrp:GHR%2BVisitors&literal.allow_token_document=Agrp:GHierarchy%2BManagers&literal.allow_token_document=Agrp:GRestricted%2BReaders&literal.allow_token_document=Agrp:GViewers&literal.allow_token_document=Agrp:Uc%253A0%2528.s%257Ctrue&literal.allow_token_document=Agrp:Ui%253A0%2523.w%257Ciwater%255Cadministrator&wt=xml&version=2.2}
> {add=[http://testirishwaterportal/sites/hr/Documents/A2.docx
> (1469453279104598016)]} 0 64
>
> INFO  - 2014-05-29 17:10:51.533;
> org.apache.solr.update.processor.LogUpdateProcessor; [collection1]
> webapp=/solr1 path=/update/extract
> params={literal.deny_token_document=Agrp:DEAD_AUTHORITY&literal.content_id={90CDF9E2-12F3-49C3-A37A-DF3F60DBC44F}&
> literal.id=
> http://testirishwaterportal/sites/hr/Documents/Test%252011111.docx&resource.name=Test+11111.docx&literal.allow_token_document=Agrp:GApprovers&literal.allow_token_document=Agrp:GDesigners&literal.allow_token_document=Agrp:GHR%2BMembers&literal.allow_token_document=Agrp:GHR%2BOwners&literal.allow_token_document=Agrp:GHR%2BVisitors&literal.allow_token_document=Agrp:GHierarchy%2BManagers&literal.allow_token_document=Agrp:GRestricted%2BReaders&literal.allow_token_document=Agrp:GViewers&literal.allow_token_document=Agrp:Uc%253A0%2528.s%257Ctrue&literal.allow_token_document=Agrp:Ui%253A0%2523.w%257Ciwater%255Cadministrator&wt=xml&version=2.2}
> {add=[http://testirishwaterportal/sites/hr/Documents/Test%2011111.docx
> (1469453279196872704)]} 0 63
>
>
>
> On Thu, May 29, 2014 at 12:28 PM, Karl Wright <daddywri@gmail.com> wrote:
>
>> Hi Lalit,
>>
>> deny_token_document being set to DEAD_AUTHORITY seems to imply you have
>> selected "active directory" as the authorization type for you connection.
>> (This is done on the Authority Type tab.)  But it may be the case that you
>> are using SharePoint in Claims-based mode.  If that's true, you should have:
>>
>> - Select "Native" as the authority type
>> - Set up a SharePoint/Native authority
>> - If you have AD involved, also set up a SharePoint/ActiveDirectory
>> authority.
>>
>> FWIW, the two metadata values you want to watcht in the Solr URL are:
>>
>> literal.deny_token_document=DEAD_AUTHORITY
>> literal.allow_token_document=
>>
>> You should see something in the allow field if your configuration is
>> right, and the SharePoint document is visible to anyone at all.
>>
>> You will also need to add appropriate fields in Solr for security tokens,
>> but I imagine you've already done that.
>> Thanks,
>> Karl
>>
>>
>>
>> On Thu, May 29, 2014 at 6:43 AM, lalit jangra <lalit.j.jangra@gmail.com>
>> wrote:
>>
>>> Hi,
>>>
>>> I have configured a job to crawl sharepoint with Apache MCF & storing
>>> index in solr.I run the job, it works fine without any error on screen &
>>> MCF logs and completes elegantly. I have a custom solr schema that works
>>> for alfresco fine.
>>>
>>> Now when i go back to solr admin screen to query added documents, i am
>>> not able to see any sharepoint docs at all & see some deny_token details
in
>>> solr logs. I also have tried to map/unmap properties from sharepoint to
>>> solr in MCf job screen but of no avail.
>>>
>>> Can anyone help me here?
>>>
>>> INFO  - 2014-05-29 10:34:18.100;
>>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1]
>>> webapp=/solr1 path=/update/extract params={commit=true&wt=xml&version=2.2}
>>> {commit=} 0 95
>>> INFO  - 2014-05-29 10:41:16.555;
>>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1]
>>> webapp=/solr1 path=/update/extract
>>> params={literal.GUID={065F52D2-0192-43CF-B0FE-6B7DF4A25561}&literal.deny_token_document=DEAD_AUTHORITY&
>>> literal.id=
>>> http://sharepontsite/sites/hr/sites/hr/Lists/Announcements/DispForm.aspx?ID%3D3&resource.name=docname&literal.allow_token_document=&wt=xml&version=2.2}
>>> {add=[
>>> http://sharepontsite/sites/hr/sites/hr/Lists/Announcements/DispForm.aspx?ID=3
>>> (1469428768758038528)]} 0 3
>>> INFO  - 2014-05-29 10:41:16.576;
>>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1]
>>> webapp=/solr1 path=/update/extract
>>> params={literal.GUID={179F2C90-14A0-4097-A9F4-C0D2CD9D65B1}&literal.deny_token_document=DEAD_AUTHORITY&
>>> literal.id=
>>> http://sharepontsite/sites/hr/sites/hr/Lists/Announcements/DispForm.aspx?ID%3D2&resource.name=docname&literal.allow_token_document=&wt=xml&version=2.2}
>>> {add=[
>>> http://sharepontsite/sites/hr/sites/hr/Lists/Announcements/DispForm.aspx?ID=2
>>> (1469428768771670016)]} 0 11
>>> INFO  - 2014-05-29 10:41:17.004;
>>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1]
>>> webapp=/solr1 path=/update/extract
>>> params={literal.GUID={065F52D2-0192-43CF-B0FE-6B7DF4A25561}:IrishWater_-_ECM_-_High_Availability_Design.docx&literal.deny_token_document=DEAD_AUTHORITY&
>>> literal.id=
>>> http://sharepontsite/sites/hr/Lists/Announcements/Attachments/3/IrishWater_-_ECM_-_High_Availability_Design.docx&resource.name=IrishWater_-_ECM_-_High_Availability_Design.docx&literal.allow_token_document=&wt=xml&version=2.2}
>>> {add=[
>>> http://sharepontsite/sites/hr/Lists/Announcements/Attachments/3/IrishWater_-_ECM_-_High_Availability_Design.docx
>>> (1469428769214169088)]} 0 199
>>> INFO  - 2014-05-29 10:41:17.343;
>>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1]
>>> webapp=/solr1 path=/update/extract
>>> params={literal.GUID={39D4D9C1-301B-4082-94D7-818323509ABC}&literal.deny_token_document=DEAD_AUTHORITY&
>>> literal.id=
>>> http://sharepontsite/sites/hr/Shared%2520Documents/IrishWater_-_ECM_-_High_Availability_Design.docx&resource.name=IrishWater_-_ECM_-_High_Availability_Design.docx&literal.allow_token_document=&wt=xml&version=2.2}
>>> {add=[
>>> http://sharepontsite/sites/hr/Shared%20Documents/IrishWater_-_ECM_-_High_Availability_Design.docx
>>> (1469428769581170688)]} 0 171
>>> INFO  - 2014-05-29 10:41:31.555;
>>> org.apache.solr.update.DirectUpdateHandler2; start
>>> commit{,optimize=false,openSearcher=false,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
>>> INFO  - 2014-05-29 10:41:31.662;
>>> org.apache.solr.core.SolrDeletionPolicy; SolrDeletionPolicy.onCommit:
>>> commits: num=2
>>>
>>> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@/app/solr/example/solr/collection1/data/index
>>> lockFactory=org.apache.lucene.store.NativeFSLockFactory@42ef8490;
>>> maxCacheMB=48.0 maxMergeSizeMB=4.0),segFN=segments_9,generation=9}
>>>
>>>
>>> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@/app/solr/example/solr/collection1/data/index
>>> lockFactory=org.apache.lucene.store.NativeFSLockFactory@42ef8490;
>>> maxCacheMB=48.0 maxMergeSizeMB=4.0),segFN=segments_a,generation=10}
>>> INFO  - 2014-05-29 10:41:31.663;
>>> org.apache.solr.core.SolrDeletionPolicy; newest commit generation = 10
>>> INFO  - 2014-05-29 10:41:31.669;
>>> org.apache.solr.search.SolrIndexSearcher; Opening Searcher@2cf5f14b
>>> realtime
>>> INFO  - 2014-05-29 10:41:31.670;
>>> org.apache.solr.update.DirectUpdateHandler2; end_commit_flush
>>> INFO  - 2014-05-29 10:41:49.886;
>>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1]
>>> webapp=/solr1 path=/update/extract
>>> params={literal.GUID={90CDF9E2-12F3-49C3-A37A-DF3F60DBC44F}&literal.deny_token_document=DEAD_AUTHORITY&
>>> literal.id=
>>> http://sharepontsite/sites/hr/Documents/Test%252011111.docx&resource.name=Test+11111.docx&literal.allow_token_document=&wt=xml&version=2.2}
>>> {add=[http://sharepontsite/sites/hr/Documents/Test%2011111.docx
>>> (1469428803708125184)]} 0 55
>>> INFO  - 2014-05-29 10:41:49.982;
>>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1]
>>> webapp=/solr1 path=/update/extract
>>> params={literal.GUID={25CAEB55-ECEB-4ACC-A45F-78E35E877024}&literal.deny_token_document=DEAD_AUTHORITY&
>>> literal.id=
>>> http://sharepontsite/sites/hr/Documents/A2.docx&resource.name=A2.docx&literal.allow_token_document=&wt=xml&version=2.2}
>>> {add=[http://sharepontsite/sites/hr/Documents/A2.docx
>>> (1469428803806691328)]} 0 49
>>> INFO  - 2014-05-29 10:41:58.249;
>>> org.apache.solr.update.DirectUpdateHandler2; start
>>> commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
>>> INFO  - 2014-05-29 10:41:58.424;
>>> org.apache.solr.core.SolrDeletionPolicy; SolrDeletionPolicy.onCommit:
>>> commits: num=2
>>>
>>> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@/app/solr/example/solr/collection1/data/index
>>> lockFactory=org.apache.lucene.store.NativeFSLockFactory@42ef8490;
>>> maxCacheMB=48.0 maxMergeSizeMB=4.0),segFN=segments_a,generation=10}
>>>
>>> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@/app/solr/example/solr/collection1/data/index
>>> lockFactory=org.apache.lucene.store.NativeFSLockFactory@42ef8490;
>>> maxCacheMB=48.0 maxMergeSizeMB=4.0),segFN=segments_b,generation=11}
>>> INFO  - 2014-05-29 10:41:58.425;
>>> org.apache.solr.core.SolrDeletionPolicy; newest commit generation = 11
>>> INFO  - 2014-05-29 10:41:58.433;
>>> org.apache.solr.search.SolrIndexSearcher; Opening Searcher@4f1ea922 main
>>> INFO  - 2014-05-29 10:41:58.435;
>>> org.apache.solr.core.QuerySenderListener; QuerySenderListener sending
>>> requests to Searcher@4f1ea922
>>> main{StandardDirectoryReader(segments_b:47:nrt _c(4.6):C4 _e(4.6):C1
>>> _d(4.6):C1)}
>>> INFO  - 2014-05-29 10:41:58.436;
>>> org.apache.solr.core.QuerySenderListener; QuerySenderListener done.
>>> INFO  - 2014-05-29 10:41:58.437;
>>> org.apache.solr.update.DirectUpdateHandler2; end_commit_flush
>>> INFO  - 2014-05-29 10:41:58.441; org.apache.solr.core.SolrCore;
>>> [collection1] Registered new searcher Searcher@4f1ea922
>>> main{StandardDirectoryReader(segments_b:47:nrt _c(4.6):C4 _e(4.6):C1
>>> _d(4.6):C1)}
>>> INFO  - 2014-05-29 10:41:58.444;
>>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1]
>>> webapp=/solr1 path=/update/extract params={commit=true&wt=xml&version=2.2}
>>> {commit=} 0 195
>>>
>>>
>>>
>>> --
>>> Regards,
>>> Lalit Jangra.
>>>
>>
>>
>
>
> --
> Regards,
> Lalit Jangra.
>

Mime
View raw message