问题描述
任何人都没有有关如何为SQL Server 2005创建自定义分词系统的信息.我更喜欢用c#编写它.我需要能够搜索诸如'c#','f#'等之类的术语,但是'#'字符是英语(UK)分词系统组件中的分词系统,不能以任何其他方式更改. /p>
我发现以下文章提供了不完整的示例(缺少IWordSink接口),并引用了不再可用的文章.本文还没有提供我希望看到的任何线程检查代码. http://bytes.com/topic/sql-server/answers/864969-custom-wordbreaker-sql-server-全文
或者,有人可以指出我如何反编译现有的英语断字符程序"langwrbk.dll",这样我可以做一些小的改动,以获取所需的现有代码?
谢谢
柯克
断字和词干提取器的技术在所有Microsoft Search产品(包括SQL Server Fulltext)中都是通用的. Search SDK完整且完整记录,请参见扩展索引和 Windows搜索开发者的指南.
反向工程langwrbk.dll将违反您同意的用户许可,因为非常清楚地规定您不能对软件进行反向工程,反编译或反汇编.更不用说这是经过代码签名的,您将无法做出一些更改" ...
Does anyone have information on how to create a custom word breaker for SQL Server 2005. I'd prefer to write it in c#. I need to be able to search on terms such as 'c#', 'f#' etc but the '#' character is a word breaker in the English (UK) word breaker component and can't be changed in any other way.
I have found the following article which provides an incomplete (the IWordSink interface is missing) sample and references an article that is no longer available. This article also doesn't provide any of the thread-checking code I'd expect to see.http://bytes.com/topic/sql-server/answers/864969-custom-wordbreaker-sql-server-full-text
Alternately, someone could point me to how to decompile the existing English word breaker 'langwrbk.dll' so I could make the little change I need the existing code?
Thanks
Kirk
The technology for word breakers and stemmers is common across all the Microsoft Search products, including SQL Server Fulltext. The Search SDK is well and fully documented see Extending the Index and Windows Search Developer's Guide.
Reverse engineering the langwrbk.dll would be against the user license you agreed on, as is very clearly stipulated that you cannot reverse engineer, decompile or disassemble the Software. Not to mention that is code signed and you wouldn't be able to 'make a little change'...
这篇关于SQL Server全文的自定义WordBreaker的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!